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5 DESCRIPTION 

TESTIS -SPECIFIC TRANSCRIPTION FACTOR ZGCL-1 

BACKGROUND OF THE INVENTION 

10 Proliferation and differentiation of cells, of 

multicellular organisms are controlled by hormones and 
polypeptide growth factors. These diffusable molecules 
allow cells to communicate with each other and act in 
concert to form cells and organs, and to repair and 

15 regenerate damaged tissue. Examples of hormones and growth 
factors include the steroid hormones (e.g. estrogen, 
testosterone) , parathyroid hormone, follicle stimulating 
hormone, the interleukins , platelet derived growth factor 
(PDGF) , epidermal growth factor (EGF) , granulocyte- 

2 0 macrophage colony stimulating factor (GM-CSF) , 

erythropoietin (EPO) and calcitonin. 

Hormones and growth factors influence cellular 
metabolism by binding to receptors. Receptors may be 
integral membrane proteins that are linked to signaling 

25 pathways within the cell, such as second messenger systems. 
Other classes of receptors are soluble molecules, such as 
the transcription factors. 

Transcription factors regulate the transcription 
of genes in the cell by interacting with other 

30 transcription factors and RNA polymerase. Transcription 
factors are characterized by their DNA-binding domain and 
their transcriptional activation domain. Within the DNA 
binding domain, several different motifs have been 
identified which act to mediate DNA binding to 

35 transcription factors. These include cysteine-histidine 
zinc finger and multi-cysteine zinc finger motifs, homeobox 
motifs, winged helix motifs, leucine -zipper motifs, and 
helix-loop-helix motifs. The activation domain can contain 
a large number of acetic amino acids which form an 
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amphipathic cc-helix with its negative charges displayed on 
one surface. Others have glutamine or proline-rich 
regions . 

Following DNA binding, the transcription factor 
5 interacts with other factors or the RNA polymerase to 
stimulate transcription. Transcription can be blocked by 
molecules which are able to bind to the DNA binding domain 
but do not interact with the transcription domain. These 
repressor molecules prevent positively acting DNA molecules 

10 from binding. 

The location of the transcription factors bound 
by a particular gene control the gene's expression pattern. 
If a gene binds ' a transcription factor which is 
ubiquitously expressed, then the gene expression will be as 

15 well. If the gene binds a transcription factor which is 
synthesized or active only in a limited number of cells, 
gene expression will be more cell specific. Regulation of 
gene expression is for the most part controlled by 
transcription factors. This regulation provides that the 

2 0 correct gene is activated in the appropriate cell at the 
precise time for development. Identification of 

transcription factors and the genes they regulate greatly 
enhances our understanding of cellular development . As a 
result intervention . methods can be developed to alleviate 

25 problems associated with transcription. The present 
invention addresses this need by providing a novel 
transcription factor and related compositions and methods. 
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SUMMARY OF THE INVENTION 

The present invention provides a novel testis 
specific transcription factor and related compositions and 
5 methods . 

Within one aspect is provided an isolated 
polypeptide comprising a sequence of amino acid residues 
that is at least 80% identical in amino acid sequence to 
residues 1-479 of SEQ ID NO: 2, wherein the sequence 

10 comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO: 2. Within one embodiment the 
polypeptide is at least 90% identical in amino acid 
sequence to residues 1-479 of SEQ ID NO: 2, wherein the 
sequence comprises a POZ domain corresponding to amino acid 

15 residues 61-178 of SEQ ID NO:2. Within another embodiment 
the polypeptide is covalently linked amino terminally or 
carboxy terminally to a moiety selected from the group 
consisting of affinity tags, toxins, radionucleotides, 
enzymes and f luorophores . 

2 0 Within another aspect is provided an isolated 

polynucleotide encoding a polypeptide comprising a sequence 
of amino acid residues that is at least 80% identical in 
amino acid sequence to residues 1-479 of SEQ ID NO: 2, 
wherein the sequence comprises a POZ domain corresponding 
25 to amino acid residues 61-178 of SEQ ID NO: 2. Within one 
embodiment the polypeptide is at least 90% identical in 
amino acid sequence to residues 1-479 of SEQ ID NO: 2, 
wherein said sequence comprises a POZ domain corresponding 
to amino acid residues 61-178 of SEQ ID NO:2. 

3 0 Within another aspect the polynucleotide 

comprising the sequence of nucleotide 1 to nucleotide 143 7 
of SEQ ID NO:4. 

Within yet another aspect is provided an 
oligonucleotide probe or primer comprising at least 14 
3 5 contiguous nucleotides of a polynucleotide of SEQ ID NO: 4 
or a sequence complementary to SEQ ID NO:4. 



WO 99/09168 



PCT/US98/17243 



4 



Also provided by the invention is an expression 
vector, comprising the following operably linked elements": a 
transcription promoter; a DNA segment encoding a 
polypeptide comprising a sequence of amino acid residues 
5 that is at least 80% identical in amino acid sequence to 
residues 1-479 of SEQ ID NO: 2, wherein the sequence 
comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO:2; and a transcription terminator. 
Within one embodiment the DNA segment encodes a polypeptide 

10 that is at least 90% identical in amino acid sequence to 
residues 1-479 of SEQ ID NO: 2, wherein the sequence 
comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO:2. Within another embodiment the DNA 
segment encodes a polypeptide covalently linked amino 

15 terminally or carboxy terminally to an affinity tag. 
Within still another embodiment the DNA segment further 
encodes a secretory signal sequence operably linked to the 
polypeptide . 

Within another aspect is provided a cultured cell 
20 into which has been introduced an expression vector 
comprising the following operably linked elements: a 
transcription promoter; a DNA segment encoding a 
polypeptide comprising a sequence of amino acid residues 
that is at least 80% identical in amino acid sequence to 
25 residues 1-479 of SEQ ID NO: 2, wherein the sequence 
comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO: 2; and a transcription terminator; 
wherein the cell expresses the polypeptide encoded by the 
DNA segment . 

3 0 Within another aspect is method of producing a 

polypeptide comprising: culturing a cell into which has 
been introduced an expression vector comprising the 
following operably linked elements: a transcription 
promoter; a DNA segment encoding a polypeptide comprising a 

35 sequence of amino acid residues that is at least 80% 
identical in amino acid sequence to residues 1-479 of SEQ 
ID NO: 2, wherein the sequence comprises a POZ domain 
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corresponding to amino acid residues 61-178 of SEQ ID NO: 2; 
and a transcription terminator; whereby the cell expresses 
the polypeptide encoded by the DNA segment; and recovering 
the expressed polypeptide. 
5 Another aspect provided herein is a 

pharmaceutical composition comprising a polypeptide, the 
polypeptide comprising a sequence of amino acid residues 
that is at least 80% identical in amino acid sequence to 
residues 1-479 of SEQ ID NO: 2, wherein the sequence 

10 comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO: 2; in combination with a 
pharmaceutically acceptable vehicle. 

Within another aspect is an antibody that 
specifically binds to an epitope of a polypeptide 

15 comprising a sequence of amino acid residues that is at 
least 80% identical in amino acid sequence to residues 1- 
479 of SEQ ID NO: 2, wherein the sequence comprises a POZ 
domain corresponding to amino acid residues 61-178 of SEQ 
ID NO: 2. 

20 Also provided is a binding protein that 

specifically binds to an epitope of a polypeptide 
comprising a sequence of amino acid residues that is at 
least 80% identical in amino acid sequence to residues 1- 
479 of SEQ ID NO: 2, wherein the sequence comprises a POZ 

25 domain corresponding to amino acid residues 61-178 of SEQ 
ID NO: 2. 

Also provided is a method for detecting a genetic 
abnormality in a patient, comprising: obtaining a genetic 
sample from a patient; incubating the genetic sample with a 

3 0 polynucleotide comprising at least 14 contiguous 
nucleotides of SEQ ID NO:l or the complement of SEQ ID 
NO:l, under conditions wherein the polynucleotide will 
hybridize to complementary polynucleotide sequence, to 
produce a first reaction product; comparing the first 

3 5 reaction product to a control reaction product, wherein a 
difference between the first reaction product and the 
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control reaction product is indicative of a genetic 
abnormality in the patient. 

These and other aspects of the invention will 
become evident upon reference to the following detailed 
5 description and the attached drawing. 

BRIEF DESCRIPTION OF THE DRAWING 

The Figure shows a comparison of the deduced 
amino acid sequence of ZGCL-1 (SEQ ID NO: 2) with the 
10 deduced amino acid sequence of the Drosophila gene germ 
cell-less (dGCL-1) (SEQ ID N0:3) . 

DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention, it may be 

15 helpful to an understanding thereof to set forth 
definitions of certain terms to be used hereinafter: 

Affinity tag : is used herein to denote a 
polypeptide segment that can be attached to a second 
polypeptide to provide for purification or detection of the 

20 second polypeptide or provide sites for attachment of the 

second polypeptide to a substrate. In principal, any 

peptide or protein for which an antibody or other specific 

binding agent is available can be used as an affinity tag. 

Affinity tags include a poly-histidine tract, protein A 

25 (Nilsson et al . , EMBO J . 4:1075, 1985; Nilsson et al., 
Methods Enzvmol . 198:3, 1991) , glutathione S transferase 

(Smith and Johnson, Gene 67-31, 1988), Glu-Glu affinity tag 
(Grussenmeyer et al . , Proc. Natl. Acad. Sci . USA 82 : 7952-4 , 
1985), substance P, Flag™ peptide (Hopp et al . , 

30 Biotechnology 6:1204-10, 1988), streptavidin binding 
peptide, or other antigenic epitope or binding domain. 
See, in general, Ford et al . , Protein Expression and 
Purification 2: 95-107, 1991. DNAs encoding affinity tags 
are available from commercial suppliers (e.g., Pharmacia 

35 Biotech, Piscataway, NJ) . 

Allelic variant : Any of two or more alternative 
forms of a gene occupying the same chromosomal locus. 
Allelic variation arises naturally through mutation, and 
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may result in phenotypic polymorphism within populations. 

Gene mutations can be silent (i.e., no change in the 

encoded polypeptide) , or may encode polypeptides having 

altered amino acid sequence. The term "allelic variant" is 

5 also used herein to denote a protein encoded by an allelic 

variant of a gene. Also included are the same protein from 

the same species which differs from a reference amino acid 

sequence due to allelic variation. Allelic variation 

refers to naturally occurring differences among individuals 

10 in genes encoding a given protein. 

Amino - terminal and carboxvl - terminal : are used 

herein to denote positions within polypeptides and 

proteins. Where the context allows, these terms are used 

with reference to a particular sequence or portion of a 

15 polypeptide or protein to denote proximity or relative 
position. For example, a certain sequence positioned 
carboxyl- terminal to a reference sequence within a protein 
is located proximal to the carboxyl terminus of the 
reference sequence, but is not necessarily at the carboxyl 

20 terminus of the complete protein. 

Complement /ant i -complement pair : Deno t e s non - 
identical moieties that form a non-covalently associated, 
stable pair under appropriate conditions. For instance, 
biotin and avidin (or streptavidin) are prototypical 

25 members of a complement /ant i- complement pair. Other 
exemplary complement/anti-complement pairs include 
receptor/ligand pairs, antibody /antigen (or hapten or 
epitope) pairs, sense/antisense polynucleotide pairs, and 
the like. Where subsequent dissociation of the 

30 complement /ant i- complement pair is desirable, the 
complement /ant i- complement pair preferably has a binding 
affinity of <10" 9 M. 

Complements of polynucleotide molecules : Denotes 
polynucleotide molecules having a complementary base 

3 5 sequence and reverse orientation as compared to a reference 
sequence. For example, the sequence 5' ATGCACGGG 3 1 is 
complementary to 5 » CCCGTGCAT 3 1 . 
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Contig : Denotes a polynucleotide, a segment of 
which is equivalent in nucleotide sequence to a segment of 
another polynucleotide sequence- A "contig assembly" 
denotes a collection of EST contigs that define a larger 
5 polynucleotide segment containing an open reading frame 
encoding a full-length or partial polypeptide. 

Degenerate : As applied to a nucleotide sequence 
such as a probe or primer, denotes a sequence of 
nucleotides that includes one or more degenerate codons (as 
10 compared to a reference polynucleotide molecule that 
encodes a polypeptide) . Degenerate codons contain 
different triplets of nucleotides, but encode the same 
amino acid residue (i.e., GAU and GAC triplets each encode 
Asp) . 

15 Expression vector : A DNA molecule, linear or 

circular, that comprises a segment encoding a polypeptide 
of interest operably linked to additional segments that 
provide for its transcription. Such additional segments 
may include promoter and terminator sequences, and 

20 optionally one or more origins of replication, one or more 
selectable markers, an enhancer, a polyadenylation signal, 
and the like. Expression vectors are generally derived 
from plasmid or viral DNA, or may contain elements of both. 

Isolated : when applied to a polynucleotide, 

25 denotes that the polynucleotide has been removed from its 
natural genetic milieu and is thus free of other extraneous 
or unwanted coding sequences, and is in a form suitable for 
use within genetically engineered protein production 
systems. Such isolated molecules are those that are 

3 0 separated from their natural environment and include cDNA 
and genomic clones. Isolated DNA molecules of the present 
invention are free of other genes with which they are 
ordinarily associated, but may include naturally occurring 
5 1 and 3 1 untranslated regions such as promoters and 

35 terminators. The identification of associated regions will 
be evident to one of ordinary skill in the art (see for 
example, Dynan and Tijan, Nature 316.: 774-78, 1985) . 



WO 99/09168 



9 



PC17US98/17243 



Isolated polypeptide or protein : is a polypeptide 
or protein that is found in a condition other than 'its 
native environment, such as apart from blood and animal 
tissue. In a preferred form, the isolated polypeptide is 
5 substantially free of other polypeptides, particularly 
other polypeptides of animal origin. It is preferred to 
provide the polypeptides in a highly purified form, i.e. 
greater than 95% pure, more preferably greater than 99% 
pure. When used in this context, the term "isolated" does 

10 not exclude the presence of the same polypeptide in 
alternative physical forms, such as dimers or alternatively 
glycosylated or derivatized forms. 

Qperablv linked : As applied to nucleotide 
segments, the term "operably linked" indicates that the 

15 segments are arranged so that they function in concert for 
their intended purposes, e.g., transcription initiates in 
the promoter and proceeds through the coding segment to the 
terminator . 

Ortholog: denotes a polypeptide or protein 
2 0 obtained from one species that is the functional 
counterpart of a polypeptide or protein from a different 
species. Sequence differences among orthologs are the 
result of speciation. 

Paralocrs : are distinct but structurally related 
25 proteins made by an organism. Paralogs are believed to 
arise through gene duplication. For example, a-globin, (3- 
globin, and myoglobin are paralogs of each other. 

Polynucleotide : is a single- or double -stranded 
polymer of deoxyribonucleotide or ribonucleotide bases read 
30 from the 5' to the 3' end. Polynucleotides include RNA and 

DNA, and may be isolated from natural sources, synthesized 
in vitro, or prepared from a combination of natural and 

synthetic molecules. Sizes of polynucleotides are 

expressed as base pairs (abbreviated "bp"), nucleotides 
35 {"nt"), or kilobases ("kb"). Where the context allows, the 
latter two terms may describe polynucleotides that are 
single-stranded or double- stranded. When the term is 
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applied to double -stranded molecules it is used to denote 
overall length and will be understood to be equivalent" to 
the term "base pairs". It will be recognized by those 
skilled in the art that the two strands of a double - 
5 stranded polynucleotide may differ slightly in length and 
that the ends thereof may be staggered as a result of 
enzymatic cleavage; thus all nucleotides within a double- 
stranded polynucleotide molecule may not be paired. Such 
unpaired ends will in general not exceed 2 0 nt in length. 
. 10 Polypeptide : is a polymer of amino acid residues 

joined by peptide bonds, whether produced naturally or 
synthetically. Polypeptides of less than about 10 amino 

acid residues are commonly referred to as "peptides". 

Promoter: Denotes a portion of a gene containing 

15 DNA sequences that provide for the binding of RNA 
polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. 

Protein : is a macromolecule comprising one or 

20 more polypeptide chains. A protein may also comprise non- 
peptidic components, such as carbohydrate groups. 
Carbohydrates and other non-peptidic substituents may be 
added to a protein by the cell in which the protein is 
produced, and will vary with the type of cell. Proteins 

25 are defined herein in terms of their amino acid backbone 
structures; substituents such as carbohydrate groups are 
generally not specified, but may be present nonetheless. 

Receptor : A cell-associated protein, or a 
polypeptide subunit of such protein, that binds to a 

3 0 bioactive molecule (the "ligand") and mediates the effect 
of the ligand on the cell. Binding of ligand to receptor 
results in a change in the receptor (and, in some cases, 
receptor multimerization, i.e., association of identical or 
different receptor subunits) that causes interactions 

35 between the effector domain (s) of the receptor and other 
molecule (s) in the cell. These interactions in turn lead 
to alterations in the metabolism of the cell. Metabolic 
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events that are linked to receptor- ligand interactions 
include gene transcription, phosphorylation, 
dephosphorylation, cell proliferation, increases in cyclic 
AMP production, mobilization of cellular calcium, 
5 mobilization of membrane lipids, cell adhesion, hydrolysis 
of inositol lipids and hydrolysis of phospholipids. In 
general, receptors can be membrane bound, cytosolic or 
nuclear; monomeric (e.g., thyroid stimulating hormone 
receptor, beta-adrenergic receptor) or multimeric (e.g., 

10 PDGF receptor, growth hormone receptor, IL-3 receptor, GM- 
CSF receptor, G-CSF receptor, erythropoietin receptor and 
IL-6 receptor) . 

Secretory signal sequence : A DNA sequence that 
encodes a polypeptide (a "secretory peptide") that, as a 

15 component of a larger polypeptide, directs the larger 
polypeptide through a secretory pathway of a cell in which 
it is synthesized. The larger polypeptide is commonly 
cleaved to remove the secretory peptide during transit 
through the secretory pathway. 

20 Soluble receptor or ligand : A receptor or a 

ligand polypeptide that is not bound to a cell membrane. 
Soluble receptors are most commonly ligand-binding receptor 
polypeptides that lack transmembrane and cytoplasmic 
domains. Soluble ligands are most commonly receptor- 

25 binding polypeptides that lack transmembrane and 
cytoplasmic domains. Soluble receptors or ligands can 
comprise additional amino acid residues, such as affinity 
tags that provide for purification of the polypeptide or 
provide sites for attachment of the polypeptide to a 

3 0 substrate. Many cell -surface receptors and ligands have 
naturally occurring, soluble counterparts that are produced 
by proteolysis or translated from alternatively spliced 
mRNAs. Receptor and ligand polypeptides are said to be 
substantially free of transmembrane and intracellular 

35 polypeptide segments when they lack sufficient portions of 
these segments to provide membrane anchoring or signal 
transduction, respectively. 
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Molecular weights and lengths of polymers 
determined by imprecise analytical methods (e.g., *gel 
electrophoresis) will be understood to be approximate 
values. When such a value is expressed as "about" X or 
5 "approximately" X, the stated value of X will be understood 
to be accurate to ±10%. 

All references cited herein are incorporated by 
reference in their entirety. 

The present invention is based in part upon the 

10 discovery of a novel DNA sequence (SEQ ID NO:l) and 
corresponding polypeptide sequence (SEQ ID NO: 2) which have 
homology to the Drosophila gene "germ cell-less" (Jongens 
et al., Cell 70:569-584, 1992). Analysis of the tissue 
distribution of the mRNA corresponding to this novel DNA 

15 showed selective expression in the testes, suggesting that 
the ligand mediates processes of progenitor cell growth and 
development, such as spermatogenesis, that are unique to 
the testes. The ligand has been designated ZGCL-1. 

Novel ZGCL-1 polynucleotides and polypeptides of 

20 the present invention were initially identified by querying 
an expressed sequence tag (EST) database. Using this 
information, a novel 14 6 9 bp human cDNA fragment (SEQ ID 
NO:l) was obtained. Sequence analysis of a deduced amino 
acid sequence of ZGCL-1, as represented by SEQ ID NO: 2, 

25 indicates the presence of a potential N-glycosylation site 
at amino acid residues 256-259 of SEQ ID NO: 2, as 
predicted by the PROSITE motif "ASN-GLYCOSYLATION" . ZGCL- 
1 also contains 4 potential cAMP- and cGMP- dependent 
protein kinase phosphorylation sites at amino acid residues 

30 2-5, 16-19, 17-20, and 396-399 of SEQ ID NO:2, as predicted 
by the PROSITE motif " CAMP- PHOSPHO- SITE" . ZGCL-1 contains 
5 potential protein kinase C phosphorylation sites as 
predicted by the PROSITE motif "PKC-PHOSPHO-SITE" at amino 
acid residues 9-11, 13-15, 48-50, 57-59, and 103-105 of SEQ 

35 ID NO:2. The proteins of the present invention comprise a 
sequence of amino acid residues that is at least 80% 
identical to SEQ ID NO: 2. Within certain embodiments of 
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the invention, the sequence is at least 90% or 95% 
identical to SEQ ID NO: 2. 

ZGCL-1 also contains a POZ domain, also referred 
to as the ZIN, BTB, or BR-C/TTK domain (Chen et al., Mol . 
5 Cell, Biol . 15:3424-39, 1995; Dong et al . , Proc . Natl. 
Acad. Sci. USA 93:3624-29, 1996; Albagli et al., Biochem. 
Biophvs. Res. Comm . 220 : 911-15 , 1996; Chang et al., Proc . 
Natl . Acad . Sci . USA 93.: 6947-52, 1996 and Kaplan and 
Calame, Nucleic Acid Research 2JL:1108-16, 1997). This 

10 domain is located approximately between amino acid residues 
61-178 of SEQ ID NO : 2 . The POZ domain is found in a number 
of Zinc finger containing proteins, in Drosophila, POZ 
domain are found in Tramtrack (Harrison and Travers, EMBO 
J. 9.: 207-16, 1990), Broad- complex (DiBello et al., Genetics 

15 129:385-97, 1991), and Kelch (Xue and Cooley, Cell 72:681- 
93, 1993). The first two genes are developmental 

regulators, while the last gene codes for an actin binding 
protein. POZ is also found in POX virus genes such as 
Myxoma virus MAPI (Upton et al . , Virology 179:618-31, 1990) 

20 and vaccinia virus protein a55 (Genbank accession # 
P24768) . POZ is also found in a few human genes: KUP 
(Chardin et al., Nucleic Acid Research 19:1431-36, 1991), 
ZID (Bardwell and Treisman, Genes Dev . 8:1664-77, 1994), 
and PLZF (Chen et al . , EMBO J . 12:1161-67, 1993). POZ 

25 domains are thought to mediate protein-protein 
interactions. Most of the characterized POZ domains form 
homomeric interactions, though there are a few examples of 
POZ domains that form heteromeric interactions (Ttk and 
GAGA) . The POZ domain is associated with transcriptional 

30 regulators and proteins that contain DNA binding Zinc 
fingers, however there are some examples of non-zinc-finger 
POZ domain proteins, such as Kelch. Like Kelch, ZGCL-1 
does not contain any zinc- finger motifs and interestingly 
Kelch is involved in oogenesis. Those skilled in the art 

35 will recognize that these domain boundaries are 
approximate, and are based on alignments with known 
proteins and predictions of protein folding. 
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A comparison of the ZGCL-1 deduced amino acid 
sequence (as represented in SEQ ID NO: 2) with the deduced 
amino acid sequence of Drosophila germ cell-less gene (SEQ 
ID NO: 13) is shown in the Figure. ZGCL-1 shares 35% amino 
5 acid identity with the Drosophila "germ cell -less" gene 
(Jongens et al . , Cell 70:569-84, 1992). The DNA sequence as 
represented by SEQ ID NO:l is considered to be the human 
homolog of the Drosophila "germ cell-less" gene. 

Northern blot analysis of various human tissues 

10 was performed using a 200 bp DNA probe (SEQ ID NO: 3) . A 
3.2 kb transcript was detected corresponding to ZGCL-1. A 
high level of transcription was detected in testis and a 
lower level of transcription was detected in thyroid, 
spinal cord, stomach, lymph node and trachea. A second 

15 transcript of 4.5 kb, corresponding to a low level of 
transcription in placenta and pancreas, was al so detected . 

Radiation hybrid mapping is a somatic cell 
genetic technique developed for constructing high- 
resolution, contiguous maps of mammalian chromosomes (Cox 

20 et al., Science 250:245-250, 1990). Partial or full 
knowledge of a gene's sequence allows the designing of PCR 
primers suitable for use with chromosomal radiation hybrid 
mapping panels. Commercially available radiation hybrid 
mapping panels which cover the entire human genome, such as 

2 5 the Stanford G3 RH Panel and the GeneBridge 4 RH Panel 

(Research Genetics, Inc., Huntsville, AL) , are available. 
These panels enable rapid, PCR based, chromosomal 
localizations and ordering of genes, sequence -tagged sites 

(STSs) , and other nonpolymorphic- and polymorphic markers 
30 within a region of interest. This includes establishing 
directly proportional physical distances between newly 
discovered genes of interest and previously mapped markers. 
The precise knowledge of a gene's position can be useful in 
a number of ways including: 1) determining if a sequence is 

3 5 part of an existing contig and obtaining additional 

surrounding genetic sequences in various forms such as 
YAC-, BAC- or cDNA clones, 2) providing a possible 
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candidate gene for an inheritable disease which shows 
linkage to the same chromosomal region, and 3) for cross- 
referencing model organisms such as mouse which may be 
beneficial in helping to determine what function a 
5 particular gene might have. 

Chromosomal localization of ZGCL-1 to 5q35.3 was 
determined using radiation hybrid chimeras. Human 5q34-q35 
genes have been primarily localized to mouse chromosome 11, 
in the A5-B1 and Al-Bl region. The mouse "germ cell 

10 deficient" gene, GCD, locus maps to chromosome 11 in the 
A2-A3 region (Duncan et al . , Mamm . Genome 6.:697-9, 1995). 
Germ cell deficient leads to improper migration and/or 
proliferation of primordial germ cells during embryonic 
development resulting in infertility in the adult mouse. 

15 Mice having this mutation have been hypothesized to be 
animal models for the human reproductive disorders, 
premature ovarian failure and Sertoli cell only syndrome. 

Polynucleotide sequences encoding highly 
conserved amino acids or amino acid domains of ZGCL-1 can 

20 be used as a tool to identify new family members. For 
instance, reverse transcription-polymerase chain reaction 
(RT-PCR) can be used to amplify sequences encoding domains 
or conserved regions, described above and shown in the 
Figure, from RNA obtained from a variety of tissue sources 

25 or cell lines. In particular, highly degenerate primers 
designed from the ZGCL-1 sequences are useful for this 
purpose . 

The present invention also provides 
polynucleotide molecules, including DNA and RNA molecules, 

30 that encode the ZGCL-1 polypeptides disclosed herein. 
Those skilled in the art will readily recognize that, in 
view of the degeneracy of the genetic code, considerable 
sequence variation is possible among these polynucleotide 
molecules. SEQ ID NO: 4 is a degenerate DNA sequence that 

35 encompasses all DNAs that encode the ZGCL-1 polypeptide of 
SEQ ID NO: 2. Those skilled in the art will recognize that 
the degenerate sequence of SEQ ID NO: 4 also provides all 
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RNA sequences encoding SEQ ID NO: 2 by substituting U 
(uracil) for T (thymine) . Thus, ZGCL-1 polypeptxde- 
encoding polynucleotides comprising nucleotide 1 to 
nucleotide 1437 of SEQ ID NO: 4 and their RNA equivalents 
5 are contemplated by the present invention. Table 1 sets 
forth the one- letter codes used within SEQ ID NO: 4 to 
denote degenerate nucleotide positions. "Resolutions" are 
the nucleotides denoted by a code letter. "Complement" 
indicates the code for the complementary nucleotide (s) . 
10 For example, the code Y denotes either C (cysteine) or T, 
and its complement R denotes A (adenosine) or G (guanine) , 
A being complementary to T, and G being complementary to C. 



WO 99/09168 



PCT/US98/17243 



17 
TABLE 1 



Nucleotide Resolution Complement Resolution 



A 


A 


T 


T 


C 


C 


G 


G 


G 


G 


C 


C 


T 


T 


A 


A 


R 


A|G 


Y 


C|T 


Y 


C|T 


R 


A 1 G 


M 


A|C 


K 


G|T 


K 


G|T 


M 


A|C 


S 


C|G 


S 


C|G 


W 


A|T 


W 


A|T 


H 


A|C|T 


D 


A | G | T 


B 


C|G|T 


V 


A|C|G 


V 


A|C|G 


B 


C|G|T 


D 


A|G|T 


H 


A|C|T 


N 


A|C|G|T 


N 


A|C|G|T 



The degenerate codons used in SEQ ID NO: 4, 
5 encompassing all possible codons for a given amino acid, 
are set forth in Table 2 . 
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TABLE 2 



Afni no 
Acid 


une 
Letter 

1 /"N ^4 

Lode 






i o/"i s\v\ f 

Loaons 


Degenerate 
Codon 


Cys 


p 
c 


TbL 


TPT 

ibl 




TP\y 

TGY 


Ser 


p 

s 


AGC 


AGT 


TP A TPP TPP TPT 

TCA TCC TCG TCT 


1 IP K 1 

WSN 


Thr 


T 


ACA 


Apr 

ACC 


APP APT 

ACG ACT 


A Oil 

ACN 


Pro 


P 


pp a 

CCA 


pop 

ccc 


ppp PPT 

CCG CCT 


P P k 1 

CCN 


Ala 


A 

A 


PP a 

GCA 


ppp 

GCC 


PPP PPT 

GCG GCT 


ppk i 
GCN 


Gly 


p 

G 


pp a 

GGA 


GGC 


ppp PPT 

GGG GGT 


OO L 1 

GGN 


A _ 

Asn 


N 


A A P 

AAC 


A A T 

AAT 




A A \ / 

AAY 


A - 

Asp 


D 


O A P 

GAC 


OAT 

GAT 




O A \ / 

GAY 


Glu 


E 


P A A 

GAA 


P A P 

GAG 




GAR 


Gin 


Q 


P A A 

CAA 


P A P 

CAG 




CAR 


1 1 * 
His 


H 


A O 

CAC 


P A T 

CAT 




CAY 


Arg 


R 


AGA 


A PP 

AGG 


PP A PPP POO PPT 

CGA CGC CGG CGT 


L JO 1 1 

MGN 


Lys 


K 


AAA 

AAA 


A A P 

AAG 




A A l™\ 

AAR 


Met 


M 


A TP 

ATG 






A TO 

ATG 


1 le 


T 
I 


ATA 
Al A 


ATP 

AIL 


ATT 
Al 1 


ATH 


Leu 


1 

L 


pt a 

CTA 


pTp 

CTC 


PTP PTT T — r A TTP 

CTG CTT TTA TTG 


YTN 


Val 


i i 

V 


GTA 


GTC 


GTG GTT 


GTN 


Phe 


F 


ttc 


TTT 




TTY 


Tyr 


Y 


TAC 


TAT 




TAY 


Trp 


W 


TGG 






TGG 


Ter 


» 


TAA 


TAG 


TGA 


TRR 


Asn | Asp 


B 








RAY 


Glu|Gln 


Z 








SAR 


Any 


X 








NNN 
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One of ordinary skill in the art will appreciate 
that some ambiguity is introduced in determining" a 
degenerate codon, representative of all possible codons 
encoding each amino acid. For example, the degenerate 
5 codon for serine (WSN) can, in some circumstances, encode 
arginine (AGR) , and the degenerate codon for arginine (MGN) 
can, in some circumstances, encode serine (AGY) . A similar 
relationship exists between codons encoding phenylalanine 
and leucine. Thus, some polynucleotides encompassed by the 

10 degenerate sequence may encode variant amino acid 
sequences, but one of ordinary skill in the art can easily 
identify such variant sequences by reference to the amino 
acid sequence of SEQ ID NO: 2. Variant sequences can be 
readily tested for functionality as described herein. 

15 One of ordinary skill in the art will also 

appreciate that different species can exhibit "preferential 
codon usage." In general, see, Grantham, et al . , Nuc . 
Acids Res . .8:1893-912, 1980; Haas, et al . Curr. Biol . 
6:315-24, 1996; Wain-Hobson, et al . , Gene 13:355-64, 1981; 

20 Grosjean and Fiers, Gene 18:199-209, 1982; Holm, Nuc. Acids 
Res . 14:3075-87, 1986; Ikemura, J. Mol . Biol . 158 :573-97, 
1982. As used herein, the term "preferential codon usage" 
or "preferential codons" is a term of art referring to 
protein translation codons that are most frequently used in 

25 cells of a certain species, thus favoring one or a few 
representatives of the possible codons encoding each amino 
acid (See Table 2) . For example, the amino acid threonine 
(Thr) may be encoded by ACA, ACC, ACG, or ACT, but in 
mammalian cells ACC is the most commonly used codon; in 

30 other species, for example, insect cells, yeast, viruses or 
bacteria, different Thr codons may be preferential. 
Preferential codons for a particular species can be 
introduced into the polynucleotides of the present 
invention by a variety of methods known in the art. 

35 Introduction of preferential codon sequences into 
recombinant DNA can, for example, enhance production of the 
protein by making protein translation more efficient within 
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a particular cell type or species. Therefore, the 
degenerate codon sequence disclosed in SEQ ID NO: 4 serves 
as a template for optimizing expression of polynucleotides 
in various cell types and species commonly used in the art 
5 and disclosed herein. Sequences containing preferential 
codons can be tested and optimized for expression in 
various species, and tested for functionality as disclosed 
herein. 

Within preferred embodiments of the invention, 

10 isolated polynucleotides will hybridize to similar sized 
regions of SEQ ID NO:l, or to a sequence complementary 
thereto, under stringent conditions. In general, stringent 
conditions are selected to be about 5°C lower than the 
thermal melting point (T m ) for the specific sequence at a 

15 defined ionic strength and pH. The T m is the temperature 
(under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. 
Typical stringent conditions are those in which the salt 
concentration is up to about 0.03 M at pH 7 and the 

20 temperature is at least about 60°C. As previously noted, 
the isolated polynucleotides of the present invention 
include DNA and RNA. Methods for isolating DNA and RNA are 
well known in the art. It is generally preferred to isolate 
RNA from testis although DNA can also be prepared using RNA 

25 from other tissues or isolated as genomic DNA. Total RNA 
can be prepared using guanidine HCl extraction followed by 
isolation by centrif ugation in a CsCl gradient (Chirgwin et 
al., Biochemistry 18:52-94, 1979). Poly (A) + RNA is 
prepared from total RNA using the method of Aviv and Leder 

30 ( Proc. Natl. Acad. Sci. USA £9:1408-12, 1972) . 
Complementary DNA (cDNA) is prepared from poly (A) + RNA 
using known methods. Polynucleotides encoding ZGCL-1 
polypeptides are then identified and isolated by, for 
example, hybridization or PCR. 

35 Those skilled in the art will recognize that the 

sequence disclosed in SEQ ID N0:1 represents a single 
allele of the human ZGCL-1 gene, and that allelic variation 
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and alternative splicing, "splice variants", are expected 
to exist. Allelic variants of the DNA sequence shown in 
SEQ ID NO:l, including those containing silent mutations 
and those in which mutations result in amino acid sequence 
5 changes, are within the scope of the present invention, as 
are proteins which are allelic variants of SEQ ID NO: 2. 
Splice variant is used herein to denote alternative forms 
of RNA transcribed from a gene. Splice variation arises 
naturally through use of alternative splicing sites within 

10 a transcribed RNA molecule, or less commonly between 
separately transcribed RNA molecules, and may result in 
several mRNAs transcribed from the same gene. Splice 
variants may encode polypeptides having altered amino acid 
sequence. cDNAs generated from alternatively spliced 

15 mRNAs, which retain the properties of the ZGCL-1 
polypeptide are included within the scope to the present 
invention, as are polypeptides encoded by such cDNAs and 
mRNAs. Allelic variants and splice variants of these 
sequences can be cloned by probing cDNA or genomic 

20 libraries from different individuals or tissues according 
to standard procedures known in the art . 

The present invention further provides 
counterpart ligands and polynucleotides from other species 
(orthologs) . These orthologous polynucleotides can by 

25 used, inter alia, to prepare the respective orthologous 
proteins. These species would include, but are not limited 
to, mammalian, avian, amphibian, reptile, fish, insect and 
other vertebrate and invertebrate species . Of particular 
interest are ZGCL-1 ligand polypeptides from other 

30 mammalian species, including murine, porcine, ovine, 
bovine, canine, feline, equine, and other primate ligands. 
Orthologs of human ZGCL-1 can be cloned using information 
and compositions provided by the present invention in 
combination with conventional cloning techniques. For 

35 example, a cDNA can be cloned using mRNA obtained from a 
tissue or cell type that expresses the ligand. Suitable 
sources of mRNA can be identified by probing Northern blots 
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with probes designed from the sequences disclosed herein. 
A library is then prepared from mRNA of a positive tissue 
or cell line. A ZGCL-1- encoding cDNA can then be isolated 
by a variety of methods, such as by probing with a complete 
5 or partial human cDNA or with one or more sets of 
degenerate probes based on the disclosed sequence. A cDNA 
can also be cloned using the polymerase chain reaction 
(PCR) (Mullis, U.S. Patent No. 4,683,202), using primers 
designed from the sequences disclosed herein. Within an 

10 additional method, the cDNA library can be used to 
transform or transfect host cells, and expression of the 
cDNA of interest can be detected with an antibody to ZGCL- 
1. Similar techniques can also be applied to the isolation 
of genomic clones. 

15 The present invention also provides isolated 

ligand polypeptides that are substantially homologous to 
the ligand polypeptide of SEQ ID NO: 2 and its species 
orthologs. By "isolated" is meant a protein or polypeptide 
that is found in a condition other than its native 

20 environment, such as apart from blood and animal tissue. 
In a preferred form, the isolated protein or polypeptide is 
substantially free of other proteins or polypeptides, 
particularly other proteins or polypeptides of animal 
origin. It is preferred to provide the proteins or 

25 polypeptides in a highly purified form, i.e. greater than 
95% pure, more preferably greater than 99% pure. The term 
"substantially homologous" is used herein to denote 
proteins or polypeptides having 50%, preferably 6 0%, more 
preferably at least 80%, sequence identity to the sequence 

3 0 shown in SEQ ID NO: 2 or its species orthologs. Such 
proteins or polypeptides will more preferably be at least 
90% identical, and most preferably 95% or more identical to 
SEQ ID NO: 2 or its species orthologs. Percent sequence 
identity is determined by conventional methods. See, for 

35 example, Altschul et al., Bull. Math. Bio. 48 : 603-16, 1986 
and Henikoff and Henikoff, Proc. Natl. Acad. Sci . USA 
^9:10915-19, 1992. Briefly, two amino acid sequences are 
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aligned to optimize the alignment scores using a gap 
opening penalty of 10, a gap extension penalty of 1, and 
the "blosum 62" scoring matrix of Henikoff and Heriikoff 
( ibid . ) as shown in Table 3 (amino acids are indicated by 
5 the standard one-letter codes) . The percent identity is 
then calculated as: 

Total number of identical matches 

— x 100 

[length of the longer sequence plus the 
10 number of gaps introduced into the longer 

sequence in order to align the two sequences] 
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Sequence identity of polynucleotide molecules is determined 
by similar methods using a ratio as disclosed above. 

Substantially homologous proteins and 

polypeptides are characterized as having one or more amino 
5 acid substitutions, deletions or additions. These changes 
are preferably of a minor nature, that is conservative 
amino acid substitutions (see Table 4) and other 
substitutions that do not significantly affect the folding 
or activity of the protein or polypeptide; small deletions, 

10 typically of one to about 3 0 amino acids; and small amino- 
or carboxyl- terminal extensions, such as an amino-terminal 
methionine residue, a small linker peptide of up to about 
20-25 residues, or a small extension that facilitates 
purification, an affinity tag. Polypeptides comprising 

15 affinity tags can further comprise a proteolytic cleavage 
site between the ZGCL-1 polypeptide and the affinity tag. 
Preferred such sites include thrombin cleavage sites and 
factor Xa cleavage sites. 
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Table 4 

Conservative amino acid substitutions 

5 Basic: arginine 

lysine 

histidine 
Acidic: glutamic acid 

aspartic acid 
10 Polar: glutamine 

asparagine 
Hydrophobic : leucine 

isoleucine 

valine 

15 Aromatic: phenylalanine 

tryptophan 
tyrosine 
Small: glycine 

alanine 

2 0 serine 

threonine 
methionine 



The proteins of the present invention can also 

25 comprise non-naturally occurring amino acid residues. Non- 

naturally occurring amino acids include, without 
limitation, trans- 3 -methylproline, 2 , 4-methanoproline , cis- 

4-hydroxyproline , trans- 4 -hydroxyproline , N- methyl -glycine , 

alio- threonine , methyl threonine, hydroxyethyl- cysteine, 

3 0 hydroxyethylhomocysteine , nitroglutamine , homo -glutamine , 

pipecolic acid, thiazolidine carboxylic acid, 

dehydroproline, 3- and 4 -methylproline, 3 , 3-dimethyl- 
proline, tert-leucine, norvaline, 2-azaphenylalanine, 3- 

azaphenylalanine , 4 -azaphenylalanine , and 4 - f luoro- 

35 phenylalanine. Several methods are known in the art for 

incorporating non-naturally occurring amino acid residues 
into proteins. For example, an in vitro system can be 
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employed wherein nonsense mutations are suppressed using 
chemically aminoacylated suppressor tRNAs . Methods 'for 
synthesizing amino acids and aminoacylating tRNA are known 
in the art. Transcription and translation of plasmids 
5 containing nonsense mutations is carried out in a cell -free 
system comprising an E. coli S3 0 extract and commercially 

available enzymes and other reagents. Proteins are 

purified by chromatography. See, for example, Robertson et 

al., J. Am. Chem. Soc . 113 : 2722 , 1991; Ellman et al., 

10 Methods Enzvmol . 202:301, 1991; Chung et al . , Science 

259:806-9, 1993; and Chung et al., Proc. Natl. Acad. Sci . 

USA ,90:10145-9, 1993). In a second method, translation is 
carried out in Xenopus oocytes by microinjection of mutated 

mRNA and chemically aminoacylated suppressor tRNAs 
15 (Turcatti et al . , J. Biol. Chem. 271 : 19991-8 . 1996). 
Within a third method, E. coli cells are cultured in the 

absence of a natural amino acid that is to be replaced 

(e.g., phenylalanine) and in the presence of the desired 

non-naturally occurring amino acid(s) (e.g., 2- 

20 azaphenylalanine, 3 -azaphenylalanine , 4-azaphenylalanine, 

or 4-f luorophenylalanine) . The non-naturally occurring 

amino acid is incorporated into the protein in place of its 

natural counterpart. See, Koide et al . , Biochem . 33 : 7470- 

6, 1994. Naturally occurring amino acid residues can be 
25 converted to non-naturally occurring species by in vitro 

chemical modification. Chemical modification can be 

combined with site-directed mutagenesis to further expand 
the range of substitutions (Wynn and Richards, Protein Sci. 
2 :395-403, 1993) . 

30 A limited number of non-conservative amino acids, 

amino acids that are not encoded by the genetic code, non- 
naturally occurring amino acids, and unnatural amino acids 
may be substituted for ZGCL-1 amino acid residues. 

Essential amino acids in the polypeptides of the 

35 present invention can be identified according to procedures 
known in the art, such as site-directed mutagenesis or 
alanine -scanning mutagenesis (Cunningham and Wells, Science 
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244 : 1081-5, 1989; Bass et al . , Proc . Natl. Acad. Sci . USA 
88:4498-502, 1991) . In the latter technique, single 
alanine mutations are introduced at every residue in the 
molecule, and the resultant mutant molecules are tested for 
5 biological activity as disclosed below to identify amino 
acid residues that are critical to the activity of the 
molecule. See also, Hilton et al . , J. Biol. Chem . 271:4699- 
708, 1996. Sites of ligand-receptor interaction can also 
be determined by physical analysis of structure, as 

10 determined by such techniques as nuclear magnetic 
resonance, crystallography, electron diffraction or 
photoaf f inity labeling, in conjunction with mutation of 
putative contact site amino acids. See, for example, de 
Vos et al., Science 255:306-12, 1992; Smith et al . , J. Mol . 

15 Biol . 224:899-904, 1992; Wlodaver et al . , FEBS Lett . 
309 : 59-64 , 1992. The identities of essential amino acids 
can also be inferred from analysis of homologies with 
related "germ cell-less" proteins. 

Multiple amino acid substitutions can be made and 

20 tested using known methods of mutagenesis and screening, 
such as those disclosed by Reidhaar-Olson and Sauer 
( Science 241:53-7, 1988) or Bowie and Sauer ( Proc. Natl. 
Acad. Sci. USA 86:2152-6, 1989) . Briefly, these authors 
disclose methods for simultaneously randomizing two or more 

25 positions in a polypeptide, selecting for functional 

polypeptide, and then sequencing the mutagenized 

polypeptides to determine the spectrum of allowable 

substitutions at each position. Other methods that can be 
used include phage display (e.g., Lowtnan et al., Biochem . 

30 30:10832-7, 1991; Ladner et al . , U.S. Patent No. 5,223,409; 
Huse, WIPO Publication WO 92/06204) and region-directed 
mutagenesis (Derbyshire et al . , Gene 46,: 145, 1986; Ner et 
al. , DNA 7:127, 1988) . 

Variants of the disclosed ZGCL-1 DNA and 

3 5 polypeptide sequences can be generated through DNA 
shuffling as disclosed by Stemmer, Nature 370 -.389-91 . 1994, 
Stemmer, Proc. Natl. Acad. Sci. USA 91:10747-51, 1994 and 
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WIPO Publication WO 97/20078. Briefly, variant DNAs are 
generated by in vitro homologous recombination by random 

fragmentation of a parent DNA followed by reassembly using 
PCR, resulting in randomly introduced point mutations. 
5 This technique can be modified by using a family of parent 
DNAs, such as allelic variants or DNAs from different 
species, to introduce additional variability into the 
process. Selection or screening for the desired activity, 
followed by additional iterations of mutagenesis and assay 

10 provides for rapid "evolution" of sequences by selecting 
for desirable mutations while simultaneously selecting 
against detrimental changes. 

Mutagenesis methods as disclosed above can be 
combined with high- throughput screening methods to detect 

15 activity of cloned, mutagenized ligands. Mutagenized DNA 
molecules that encode active ligands or portions thereof 
(e.g., receptor-binding fragments) can be recovered from 
the host cells and rapidly sequenced using modern 
equipment . These methods allow the rapid determination of 

2 0 the importance of individual amino acid residues in a 
polypeptide of interest, and can be applied to polypeptides 
of unknown structure. 

Using the methods discussed above, one of 
ordinary skill in the art can identify and/or prepare a 

25 variety of polypeptides that are substantially homologous 
to SEQ ID NO: 2 or allelic variants thereof and retain the 
transcription mediating properties of the wild- type 
protein. Such polypeptides may include affinity tags and 
the like. Such polypeptides may also include additional 

30 polypeptide segments as generally disclosed above. 

The ZGCL-1 polypeptides of the present invention, 
including full-length polypeptides, fragments (e.g., DNA- 
binding fragments) , and fusion polypeptides, can be 
produced in genetically engineered host cells according to 

35 conventional techniques. Suitable host cells are those 
cell types that can be transformed or transfected with 
exogenous DNA and grown in culture, and include bacteria, 
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fungal cells, and cultured higher eukaryotic cells. 
Eukaryotic cells, particularly cultured cells " of 
multicellular organisms, are preferred. Techniques for 
manipulating cloned DNA molecules and introducing exogenous 
5 DNA into a variety of host cells are disclosed by Sambrook 
et al., Molecular Cloning : A Laboratory Manual , Second 
Edition , Cold Spring Harbor, NY, 1989/ and Ausubel et al . , 
eds., Current Protocols in Molecular Biology , John Wiley 
and Sons, Inc., NY, 1987. 

10 In general, a DNA sequence encoding a 2GCL-1 

polypeptide is operably linked to other genetic elements 
required for its expression, generally including a 
transcription promoter and terminator, within an expression 
vector. The vector will also commonly contain one or more 

15 selectable markers and one or more origins of replication, 
although those skilled in the art will recognize that 
within certain systems selectable markers may be provided 
on separate vectors, and replication of the exogenous DNA 
may be provided by integration into the host cell genome. 

2 0 Selection of promoters, terminators, selectable markers, 
vectors and other elements is a matter of routine design 
within the level of ordinary skill in the art. Many such 
elements are described in the literature and are available 
through commercial suppliers. 

2 5 To direct a ZGCL-1 polypeptide into the secretory 

pathway of a host cell, a secretory signal sequence (also 
known as a signal sequence, leader sequence, prepro 
sequence or pre sequence) is provided in the expression 
vector. The secretory signal sequence may be that of the 

3 0 ZGCL-1 polypeptide, or may be derived from another secreted 

protein (e.g., t-PA) or synthesized de novo. The secretory 
signal sequence is joined to the ZGCL-1 DNA sequence in the 
correct reading frame. Secretory signal sequences are 
commonly positioned 5' to the DNA sequence encoding the 
3 5 polypeptide of interest, although certain secretory signal 
sequences may be positioned elsewhere in the DNA sequence 
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of interest (see, e.g., Welch et al . , U.S. Patent No. 
5,037,743; Holland et al., U.S. Patent No. 5,143,830). 

Cultured mammalian cells are suitable hosts 
within the present invention. Methods for introducing 
5 exogenous DNA into mammalian host cells include calcium 
phosphate -mediated transfection (Wigler et al . , Cell 
14:725, 1978; Corsaro and Pearson, Somatic Cell Genetics 
7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), 
electroporation (Neumann et al . , EMBO J . 1:841-45, 1982), 

10 DEAE-dextran mediated transfection (Ausubel et al . , ibid), 
and liposome-mediated transfection (Hawley- Nelson et al . , 
Focus 15:73, 1993; Ciccarone et al . , Focus 15:80, 1993). 
The production of recombinant polypeptides in cultured 
mammalian cells is disclosed, for example, by Levinson et 

15 al., U.S. Patent No. 4,713,339; Hagen et al . , U.S. Patent 
No. 4,784,950; Palmiter et al . , U.S. Patent No. 4,579,821; 
and Ringold, U.S. Patent No. 4,656,134. Suitable cultured 
mammalian cells include the COS-1 (ATCC No. CRL 1650), COS- 
7 (ATCC No. CRL 1651), BHK (ATCC No. CRL 1632), BHK 570 

20 (ATCC No. CRL 10314), 293 (ATCC No. CRL 1573; Graham et 
al., J. Gen. Virol. 36.: 5 9-72, 19 77) and Chinese hamster 
ovary (e.g., CHO-K1; ATCC No. CCL 61) cell lines. 
Additional suitable cell lines are known in the art and 
available from public depositories such as the American 

25 Type Culture Collection, Rockville, Maryland. In general, 
strong transcription promoters are preferred, such as 
promoters from SV-40 or cytomegalovirus. See, e.g., U.S. 
Patent No. 4,956,288. Other suitable promoters include 
those from metallothionein genes (U.S. Patent Nos. 

30 4,579,821 and 4,601,978) and the adenovirus major late 
promoter . 

Drug selection is generally used to select for 
cultured mammalian cells into which foreign DNA has been 
inserted. Such cells are commonly referred to as 

35 "transf ectants" . Cells that have been cultured in the 
presence of the selective agent and are able to pass the 
gene of interest to their progeny are referred to as 
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"stable transf ectants . " A preferred selectable marker is a 
gene encoding resistance to the antibiotic neomycin. 
Selection is carried out in the presence of a neomycin -type 
drug, such as G-418 or the like. Selection systems may 
5 also be used to increase the expression level of the gene 
of interest, a process referred to as "amplification." 
Amplification is carried out by culturing transf ectants in 
the presence of a low level of the selective agent and then 
increasing the amount of selective agent to select for 

10 cells that produce high levels of the products of the 
introduced genes. A preferred amplifiable selectable 
marker is dihydrof olate reductase, which confers resistance 
to methotrexate. Other drug resistance genes (e.g., 
hygromycin resistance, mult i -drug resistance, puromycin 

15 acetyltransf erase) can also be used. Alternative markers 
that introduce an altered phenotype, such as green 
fluorescent protein, or cell surface proteins such as CD4 , 
CD8, Class I MHC, placental alkaline phosphatase may be 
used to sort transfected cells from untransf ected cells by 

20 such means as FACS sorting or magnetic bead separation 
technology. 

Other higher eukaryotic cells can also be used as 
hosts, including plant cells, insect cells and avian cells. 
The use of Agrobacteriu/n rhizogenes as a vector for 
25 expressing genes in plant cells has been reviewed by Sinkar 
et al., J. Biosci . f Bancralore J 11:47-58, 1987. 

Transformation of insect cells and production of foreign 
polypeptides therein is disclosed by Guarino et al., U.S. 
Patent No. 5,162,222 and WIPO publication WO 94/06463. 

30 Insect cells can be infected with recombinant baculovirus, 
commonly derived from Autographa calif ornica nuclear 
polyhedrosis virus (AcNPV) . See, King and Possee, The 
Baculovirus Expression System: A Laboratory Guide , London, 
Chapman & Hall; O'Reilly et al., Baculovirus Expression 

35 Vectors: A Laboratory Manual , New York, Oxford University 
Press., 1994; and, Richardson, C. D. , Ed., Baculovirus 
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Expression Protocols. Methods in Molecular Biology , Totowa, 
NJ, Humana Press, 1995. A second method of making 
recombinant ZGCL-1 baculovirus utilizes a transposon-based 
system described by Luckow (Luckow et al., J Virol 67 : 4566- 
5 79, 1993) . This system, which utilizes transfer vectors, 
is sold in the Bac-to-Bac™ kit (Life Technologies, 
Rockville, MD) . This system utilizes a transfer vector, 
pFastBacl™ (Life Technologies) containing a Tn7 transposon 
to move the DNA encoding the ZGCL-1 polypeptide into a 
10 baculovirus genome maintained in E. coli as a large plasmid 

called a "bacmid." See, Hill-Perkins and Possee, J. Gen. 
Virol. 71:971-6, 1990; Bonning et al., J. Gen. Virol. 
7_5: 1551-6, 1994; and, Chazenbalk and Rapoport., J. Biol. 
Chem. 270 : 1543-9, 1995. In addition, transfer vectors can 

15 include an in-frame fusion with DNA encoding an epitope tag 
at the C- or N-terminus of the expressed ZGCL-1 
polypeptide, for example, a Glu-Glu epitope tag 
(Grussenmeyer et al., Proc. Natl. Acad. Sci. 82 : 7952-4, 
1985) . Using a technique known in the art, a transfer 

20 vector containing ZGCL-1 is transformed into E. coli, and 
screened for bacmids which contain an interrupted lacZ gene 
indicative of recombinant baculovirus. The bacmid DNA 
containing the recombinant baculovirus genome is isolated, 
using common techniques, and used to transfect Spodoptera 

25 frugiperda cells, e.g. Sf9 cells. Recombinant virus that 
expresses ZGCL-1 is subsequently produced. Recombinant 
viral stocks are made by methods commonly used the art. 

The recombinant virus is used to infect host 
cells, typically a cell line derived from the fall 

30 armyworm, Spodoptera frugiperda. See, in general, Glick 
and Pasternak, Molecular Biotechnology : Principles and 
Applications of Recombinant DNA , ASM Press, Washington, 
D.C., 1994. Another suitable cell line is the High FiveO™ 
cell line (Invitrogen) derived from Trichoplusia ni (U.S. 
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Patent #5,300,435). Commercially available serum-free 
media are used to grow and maintain the cells. Suitable 
media are Sf900 II™ (Life Technologies) or ESF 921™ 
(Expression Systems) for the Sf9 cells; and Ex-cellO405™ 
5 (JRH Biosciences, Lenexa, KS) or Express FiveO™ (Life 
Technologies) for the T. ni cells. The cells are grown up 
from an inoculation density of approximately 2-5 x 10 5 
cells to a density of 1-2 x 10 6 cells at which time a 
recombinant viral stock is added at a multiplicity of 

10 infection (MOI) of - 0.1 to 10, more typically near 3. 
Procedures used are generally described in available 
laboratory manuals (King and Possee, ibid . ; O'Reilly et 
al., ibid . ; Richardson, ibid.). Subsequent purification of 
the ZGCL-1 polypeptide from the supernatant can be achieved 

15 using methods described herein. 

Fungal cells, including yeast cells, can also be 

used within the present invention. Yeast species of 
particular interest in this regard include Saccharomyces 

cerevisiae, Pichia pastoris, and Pichia methanolica. 

20 Methods for transforming S. cerevisiae cells with exogenous 

DNA and producing recombinant polypeptides therefrom are 

disclosed by, for example, Kawasaki, U.S. Patent No. 

4,599,311; Kawasaki et al., U.S. Patent No. 4,931,373; 

Brake, U.S. Patent No. 4,870,008; Welch et al . , U.S. Patent 
25 No. 5,03 7,743; and Murray et al., U.S. Patent No. 

4,845,075. Transformed cells are selected by phenotype 
. determined by the selectable marker, commonly drug 

resistance or the ability to grow in the absence of a 

particular nutrient (e.g., leucine). A preferred vector 
30 system for use in Saccharomyces cerevisiae is the POTl 

vector system disclosed by Kawasaki et al. (U.S. Patent No. 
4,931,373), which allows transformed cells to be selected 
by growth in glucose-containing media. Suitable promoters 
and terminators for use in yeast include those from 
35 glycolytic enzyme genes (see, e.g., Kawasaki, U.S. Patent 
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No. 4,599,311; Kingsman et al . , U.S. Patent No. 4,615,974; 

and Bitter, U.S. Patent No. 4,977,092) and alcohol 

dehydrogenase genes. See also U.S. Patents Nos. 4,990,446; 

5,063,154; 5,139,936 and 4,661,454. Transformation systems 

5 for other yeasts, including Hansenula polymorpha , 

Schizosaccharomyces pombe, Kluyveromyces lactis, 

Kluyveromyces fragilis , Ustilago maydis r Pichia pastoris , 

Pichia methanolica , Pichia guillermondii and Candida 

maltosa are known in the art. See, for example, Gleeson et 

10 al., J. Gen. Microbiol . 132 : 3459-65 , 1986 and Cregg, U.S. 
Patent No. 4,882,279. Aspergillus cells may be utilized 

according to the methods of McKnight et al., U.S. Patent 

No. 4,935,349. Methods for transforming Acremonium 

chrysogenum are disclosed by Sumino et al . , U.S. Patent No. 
15 5,162,228. Methods for transforming Neurospora are 

disclosed by Lambowitz, U.S. Patent No. 4,486,533. 

The use of Pichia methanolica as host for the 

production of recombinant proteins is disclosed in WIPO 

Publications WO 97/17450, WO 97/17451, WO 98/02536, and WO 
20 98/02565. DNA molecules for use in transforming P. 

methanolica will commonly be prepared as double -stranded, 

circular plasmids, which are preferably linearized prior to 
transformation. For polypeptide production in P. 

methanolica, it is preferred that the promoter and 

25 terminator in the plasmid be that of a P. methanolica gene, 

such as a P. methanolica alcohol utilization gene (AUG1 or 

AUG2) . Other useful promoters include those of the 

dihydroxyacetone synthase (DHAS) , formate dehydrogenase 

(FMD) , and catalase (CAT) genes. To facilitate integration 

30 of the DNA into the host chromosome, it is preferred to 

have the entire expression segment of the plasmid flanked 

at both ends by host DNA sequences. A preferred selectable 
marker for use in Pichia methanolica is a P. methanolica 

ADE2 gene, which encodes phosphoribosyl-5-aminoimidazole 

35 carboxylase (AIRC; EC 4.1.1.21), which allows ade2 host 

cells to grow in the absence of adenine. For large-scale, 
industrial processes- where it is desirable to minimize the 
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use of methanol, it is preferred to use host cells in which 
both methanol utilization genes {AUG1 and AUG2) 'are 

deleted. For production of secreted proteins, host cells 
deficient in vacuolar protease genes (PEP4 and PRB1) are 

5 preferred. Electroporation is used to facilitate the 

introduction of a plasmid containing DNA encoding a 
polypeptide of interest into P. methanolica cells. It is 

preferred to transform P. methanolica cells by 

electroporation using an exponentially decaying, pulsed 
10 electric field having a field strength of from 2.5 to 4.5 
kV/cm, preferably about 3.75 kV/cm, and a time constant (t) 

of from 1 to 40 milliseconds, most preferably about 20 
milliseconds . 

Prokaryotic host cells, including strains of the 

15 bacteria Escherichia coli, Bacillus and other genera are 
also useful host cells within the present invention. 
Techniques for transforming these hosts and expressing 
foreign DNA sequences cloned therein are well known in the 
art (see, e.g., Sambrook et al . , ibid.). When expressing a 

20 ZGCL-1 polypeptide in bacteria such as E. coli, the 
polypeptide may be retained in the cytoplasm, typically as 
insoluble granules, or may be directed to the periplasmic 
space by a bacterial secretion sequence. In the former 
case, the cells are lysed, and the granules are recovered 

25 and denatured using, for example, guanidine isothiocyanate 
or urea. The denatured polypeptide can then be refolded 
and dimerized by diluting the denaturant, such as by 
dialysis against a solution of urea and a combination of 
reduced and oxidized glutathione, followed by dialysis 

3 0 against a buffered saline solution. In the latter case, 
the polypeptide can be recovered from the periplasmic space 
in a soluble and functional form by disrupting the cells 
(by, for example, sonication or osmotic shock) to release 
the contents of the periplasmic space and recovering the 

35 protein, thereby obviating the need for denaturation and 
refolding. 
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Transformed or transfected host cells are 

cultured according to conventional procedures in a culture 

medium containing nutrients and other components required 

for the growth of the chosen host cells. A variety of 

5 suitable media, including defined media and complex media, 

are known in the art and generally include a carbon source, 

a nitrogen source, essential amino acids, vitamins and 

minerals. Media may also contain such components as growth 

factors or serum, as required. The growth medium will 

10 generally select for cells containing the exogenously added 

DNA by, for example, drug selection or deficiency in an 

essential nutrient which is complemented by the selectable 

marker carried on the expression vector or co-transf ected 
into the host cell. P. methanolica cells are cultured in a 

15 medium comprising adequate sources of carbon, nitrogen and 

trace nutrients at a temperature of about 25°C to 35°C. 

Liquid cultures are provided with sufficient aeration by 

conventional means, such as shaking of small flasks or 
sparging of fermentors. A preferred culture medium for P. 

20 methanolica is YEPD (2% D-glucose, 2% Bacto™ Peptone (Difco 
Laboratories, Detroit, MI) , 1% Bacto™ yeast extract (Difco 
Laboratories), 0.004% adenine and 0.006% L-leucine) . 

Expressed recombinant ZGCL-1 polypeptides (or 
chimeric ZGCL-1 polypeptides) can be purified using 

25 fractionation and/or conventional purification methods and 
media. Ammonium sulfate precipitation and acid or 

chaotrope extraction may be used for fractionation of 
samples. Exemplary purification steps may include 

hydroxyapatite, size exclusion, FPLC and reverse-phase high 

30 performance liquid chromatography. Suitable 

chromatographic media include derivatized dextrans, 
agarose, cellulose, polyacrylamide, specialty silicas, and 
the like. PEI, DEAE, QAE and Q derivatives are preferred. 
Exemplary chromatographic media include those media 

35 derivatized with phenyl, butyl, or octyl groups, such as 
Phenyl -Sepharose FF (Pharmacia) , Toyopearl butyl 650 (Toso 
Haas, Montgomeryville, PA) , Octyl -Sepharose (Pharmacia) and 
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the like; or polyacrylic resins, such as Amberchrom CG 71 
(Toso Haas) and the like. Suitable solid supports include 
glass beads, silica-based resins, cellulosic resins, 
agarose beads, cross -linked agarose beads, polystyrene 
5 beads, cross- linked polyacrylamide resins and the like that 
are insoluble under the conditions in which they are to be 
used. These supports may be modified with reactive groups 
that allow attachment of proteins by amino groups, carboxyl 
groups, sulfhydryl groups, hydroxyl groups and/or 

10 carbohydrate moieties. Examples of coupling chemistries 
include cyanogen bromide activation, N-hydroxysuccinimide 
activation, epoxide activation, sulfhydryl activation, 
hydrazide activation, and carboxyl and amino derivatives 
for carbodiimide coupling chemistries. These and other 

15 solid media are well known and widely used in the art, and 
are available from commercial suppliers. Methods for 
binding receptor polypeptides to support media are well 
known in the art. Selection of a particular method is a 
matter of routine design and is determined in part by the 

20 properties of the chosen support. See, for example, 
Affinity Chromatography: Principles & Methods , Pharmac i a 
LKB Biotechnology, Uppsala, Sweden, 1988. 

The polypeptides of the present invention can be 
isolated by exploitation of their physical properties. For 

25 example, immobilized metal ion adsorption (IMAC) 
chromatography can be used to purify histidine-rich 
proteins and those comprising polyhistidine tags. Briefly, 
a gel is first charged with divalent metal ions to form a 
chelate (Sulkowski, Trends in Biochem. 3.:1~ 7 / 1985). 

30 Histidine-rich proteins will be adsorbed to this matrix 
with differing affinities, depending upon the metal ion 
used, and will be eluted by competitive elution, lowering 
the pH, or use of strong chelating agents. Other methods 
of purification include purification of glycosylated 

35 proteins by lectin affinity chromatography and ion exchange 
chromatography ( Methods in Enzvmol . , Vol. 182, "Guide to 
Protein Purification", M. Deutscher, (ed.), Acad. Press, 
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San Diego, 1990, pp. 529-39). Within additional embodiments 
of the invention, a fusion of the polypeptide of interest 
and an affinity tag (e.g., Glu-Glu affinity tag, FLAG tag, 
maltose-binding protein, an immunoglobulin domain) may be 
5 constructed to facilitate purification. 

Protein refolding (and optionally reoxidation) 
procedures may be advantageously used. It is preferred to 
purify the protein to >80% purity, more preferably to >90% 
purity, even more preferably >95%, and particularly 

10 preferred is a pharmaceutically pure state, that is greater 
than 99.9% pure with respect to contaminating 
macromolecules, particularly other proteins and nucleic 
acids, and free of infectious and pyrogenic agents. 
Preferably, a purified protein is substantially free of 

15 other proteins, particularly other proteins of animal 
origin. 

ZGCL-1 polypeptides or fragments thereof may also 
be prepared through chemical synthesis. ZGCL-1 
polypeptides may be monomers or multimers; glycosylated or 

20 non-glycosylated; pegylated or non-pegylated; and may or 
may not include an initial methionine amino acid residue. 

An assay system that uses a ligand-binding 
receptor (or an antibody, one member of a complement/ anti- 
complement pair) or a binding fragment thereof, and a 

25 commercially available biosensor instrument (BIAcore™, 
Pharmacia Biosensor, Piscataway, NJ) may be advantageously 
employed. Such receptor, antibody, member of a 

complement/anti-complement pair or fragment is immobilized 
onto the surface of a receptor chip. Use of this 

3 0 instrument is disclosed by Karlsson, J. Immunol. Methods 
145:229-40, 1991 and Cunningham and Wells, J. Mol . Biol. 
234 : 554-63 , 1993. A receptor, antibody, member or fragment 
is covalently attached, using amine or sulfhydryl 
chemistry, to dextran fibers that are attached to gold film 

35 within the flow cell. A test sample is passed through the 
cell. If a ligand, epitope, or opposite member of the 
complement/anti-complement pair is present in the sample, 
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it will bind to the immobilized receptor, antibody or 

member, respectively, causing a change in the refractive 

index of the medium, which is detected as a change in 

surface plasmon resonance of the gold film. This system 

5 allows the determination of on- and off -rates, from which 

binding affinity can be calculated, and assessment of 

stoichiometry of binding. 

In vitro and in vivo response to ZGCL-1 can also 

be measured using cultured cells or by administering 

10 molecules of the claimed invention to the appropriate 
animal model. For instance, ZGCL-1 transfected expression 
host cells may be embedded in an alginate environment and 
injected (implanted) into recipient animals. Alginate- 
poly- L- lysine microencapsulation, permselective membrane 

15 encapsulation and diffusion chambers have been described as 
a means to entrap transfected mammalian cells or primary 
mammalian cells. These types of non- immunogenic 

"encapsulations" or microenvironments permit the transfer 
of nutrients into the microenvironment , and also permit the 

20 diffusion of proteins and other macromolecules secreted or 
released by the captured cells across the environmental 
barrier to the recipient animal. Most importantly, the 
capsules or microenvironments mask and shield the foreign, 
embedded cells from the recipient animal's immune response. 

25 Such microenvironments can extend the life of the injected 
cells from a few hours or days (naked cells) to several 
weeks (embedded cells) . 

Alginate threads provide a simple and quick means 
for generating embedded cells. The materials needed to 

3 0 generate the alginate threads are readily available and 

relatively inexpensive. Once made, the alginate threads 
are relatively strong and durable, both in vitro and, based 

on data obtained using the threads, in vivo. The alginate 

threads are easily manipulable and the methodology is 
3 5 scalable for preparation of numerous threads. In an 
exemplary procedure, 3% alginate is prepared in sterile 
H2O, and sterile filtered. Just prior to preparation of 
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alginate threads, the alginate solution is again filtered. 
An approximately 50% cell suspension (containing about "5 x 
10 5 to about 5 x 10 7 cells/ml) is mixed with the 3% alginate 
solution. One ml of the alginate/cell suspension is 
5 extruded into a 100 mM sterile filtered CaCl 2 solution over 
a time period of ""15 min, forming a "thread" . The extruded 
thread is then transferred into a solution of 50 mM CaCl 2 , 
and then into a solution of 25 mM CaCl 2 . The thread is 
then rinsed with deionized water before coating the thread 

10 by incubating in a 0.01% solution of poly-L- lysine. 
Finally, the thread is rinsed with Lactated Ringer's 
Solution and drawn from solution into a syringe barrel 
(without needle attached) . A large bore needle is then 
attached to the syringe, and the thread is 

15 intraperitoneally injected into a recipient in a minimal 

volume of the Lactated Ringer's Solution. 

An alternative in vivo approach for assaying 

proteins of the present invention involves viral delivery 
systems. Exemplary viruses for this purpose include 

20 adenovirus, herpesvirus, vaccinia virus and adeno- 
associated virus (AAV) . Adenovirus, a double- stranded DNA 
virus, is currently the best studied gene transfer vector 
for delivery of heterologous nucleic acid (for a review, 
see T.C. Becker et al., Meth. Cell Biol. 43.:161-89, 1994; 

25 and J.T. Douglas and D.T. Curiel, Science & Medicine 4:44- 
53, 1997) . The adenovirus system offers several 

advantages: adenovirus can (i) accommodate relatively 
large DNA inserts; (ii) be grown to high-titer; (iii) 
infect a broad range of mammalian cell types; and (iv) be 

3 0 used with a large number of available vectors containing 
different promoters. Also, because adenoviruses are stable 
in the bloodstream, they can be administered by intravenous 
injection. Some disadvantages (especially for gene 

therapy) associated with adenovirus gene delivery include: 

35 (i) very low efficiency integration into the host genome; 
(ii) existence in primarily episomal form; and (iii) the 
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host immune response to the administered virus, precluding 
readministration of the adenoviral vector. 

By deleting portions of the adenovirus genome, 
larger inserts (up to 7 kb) of heterologous DNA can be 
5 accommodated. These inserts may be incorporated into the 
viral DNA by direct ligation or by homologous recombination 
with a co-transf ected plasmid. In an exemplary system, the 
essential El gene has been deleted from the viral vector, 
and the virus will not replicate unless the El gene is 

10 provided by the host cell (i.e., the human 293 cell line). 
When intravenously administered to intact animals, 
adenovirus primarily targets the liver. If the adenoviral 
delivery system has an El gene deletion, the virus cannot 
replicate in the host cells. However, the host's tissue 

15 (i.e., liver) will express and process (and, if a signal 
sequence is present, secrete) the heterologous protein. 
Secreted proteins will enter the circulation in the highly 
vascularized liver, and effects on the infected animal can 
be determined. 

20 The adenovirus system can also be used for 

protein production in vitro. By culturing adenovirus- 

infected non-293 cells under conditions where the cells are 
not rapidly dividing, the cells can produce proteins for 
extended periods of time. For instance, BHK cells are 

25 grown to confluence in cell factories, then exposed to the 
adenoviral vector encoding the secreted protein of 
interest. The cells are then grown under serum- free 
conditions, which allows infected cells to survive for 
several weeks without significant cell division. 

30 Alternatively, adenovirus vector infected 293S cells can be 
grown in suspension culture at relatively high cell density 
to produce significant amounts of protein (see A. Gamier 
et al., Cvtotechnol . 15:145-55, 1994). With either 
protocol, an expressed, secreted heterologous protein can 

35 be repeatedly isolated from the cell culture supernatant. 
Within the infected 2 93S cell production protocol, non- 
secreted proteins may also be effectively obtained. 
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As would be evident to one of ordinary skill in 
the art, polyclonal antibodies can be generated from 
inoculating a variety of warm-blooded animals such as 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice, 
5 hamsters, guinea pigs and rats as well as transgenic 
animals such as transgenic sheep, cows, goats or pigs. 
Antibodies may also be expressed in yeast and fungi in 
modified forms as well as in mammalian and insect cells. 
The ZGCL-1 polypeptide or a fragment thereof serves as an 

10 antigen (immunogen) to inoculate an animal or elicit an 
immune response. Suitable antigens would include the ZGCL- 
1 polypeptide encoded by SEQ ID NO: 2 from amino acid 
residue 1-479 of SEQ ID NO: 2, or a contiguous 9-479 amino 
acid residue fragment thereof. The immunogenicity of a 

15 ZGCL-1 polypeptide may be increased through the use of an 
adjuvant, such as alum (aluminum hydroxide) or Freund's 
complete or incomplete adjuvant. Polypeptides useful for 
immunization also include fusion polypeptides, such as 
fusions of ZGCL-1 or a portion thereof with an 

20 immunoglobulin polypeptide or with an affinity tag. The 
polypeptide immunogen may be a full-length molecule or a 
portion thereof. If the polypeptide portion is "hapten- 
like", such portion may be advantageously joined or linked 
to a macromolecular carrier (such as keyhole limpet 

25 hemocyanin (KLH) , bovine serum albumin (BSA) or tetanus 
toxoid) for immunization. Preferred peptides to use as 
antigens are hydrophilic peptides such as those predicted 
by one of skill in the art from a hydrophobicity plot, see 
for example, Hopp and Woods ( Proc. Nat. Acad. Sci. USA 

30 78:3824-8, 1981) and Kyte and Doolittle ( J. Mol. Biol . 157: 
105-142, 1982) . 

As used herein, the term "antibodies" includes 
polyclonal antibodies , affinity-purified polyclonal 
antibodies, monoclonal antibodies, and antigen-binding 

35 fragments thereof, such as F(ab')2 anc * F ^b proteolytic 
fragments. Genetically engineered intact antibodies or 
fragments, such as chimeric antibodies, Fv fragments, 
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single chain antibodies and the like, as well as synthetic 
antigen-binding peptides and polypeptides, are also 
included. Non-human antibodies may be humanized by 
grafting only non-human CDRs onto human framework and 
5 constant regions, or by incorporating the entire non-human 
variable domains (optionally "cloaking" them with a human- 
like surface by replacement of exposed residues, wherein 
the result is a "veneered" antibody) . In some instances, 
humanized antibodies may retain non-human residues within 
10 the human variable region framework domains to enhance 
proper binding characteristics. Through humanizing 

antibodies, biological half -life may be increased, and the 
potential for adverse immune reactions upon administration 

to humans is reduced. Alternative techniques for 

15 generating or selecting antibodies useful herein include in 

vitro exposure of lymphocytes to ZGCL-1 protein or peptide, 

and selection of antibody display libraries in phage or 
similar vectors (for instance, through use of immobilized 
or labeled ZGCL-1 protein or peptide) . 

20 Antibodies are defined to be specifically binding 

if they bind to a ZGCL-1 polypeptide with a binding 
affinity (K a ) of 10 6 M" 1 or greater, preferably 10 7 M" 1 or 
greater, more preferably 10 8 M" 1 or greater, and most 
preferably 10 9 M" 1 or greater. The binding affinity of an 

25 antibody can be readily determined by one of ordinary skill 
in the art (for example, by Scatchard analysis) . 

Alternative techniques for generating or 
selecting antibodies useful herein include in vitro 
exposure of lymphocytes to ZGCL-1 protein or peptide, and 

30 selection of antibody display libraries in phage or similar 

vectors (for instance, through use of immobilized or 

labeled ZGCL-1 protein or peptide) . 

Genes encoding polypeptides having potential 

ZGCL-1 polypeptide binding domains can be obtained by 

35 screening random peptide libraries displayed on phage 
(phage display) or on bacteria, such as E. coli. 

Nucleotide sequences encoding the polypeptides can be 
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obtained in a number of ways, such as through random 
mutagenesis and random polynucleotide synthesis. These 
random peptide display libraries can be used to screen for 
peptides which interact with a known target which can be a 
5 protein or polypeptide, such as a ligand or receptor, a 
biological or synthetic macromolecule, or organic or 
inorganic substances. Techniques for creating and 

screening such random peptide display libraries are known 
in the art (Ladner et al . , US Patent NO. 5,223,409; Ladner 

10 et al., US Patent NO. 4,946,778; Ladner et al., US Patent 
NO. 5,403,484 and Ladner et al . , US Patent NO. 5,5.71,698) 
and random peptide display libraries and kits for screening 
such libraries are available commercially, for instance 
from Clontech (Palo Alto, CA) , Invitrogen Inc. (San Diego, 

15 CA) , New England Biolabs, Inc. (Beverly, MA) and Pharmacia 
LKB Biotechnology Inc. (Piscataway, NJ) . Random peptide 
display libraries can be screened using the ZGCL-1 
sequences disclosed herein to identify proteins which bind 
to ZGCL-1. These "binding proteins" which interact with 

20 ZGCL-1 polypeptides can be used for tagging cells; for 
isolating homolog polypeptides by affinity purification; 
they can be directly or indirectly conjugated to drugs, 
toxins, radionuclides and the like. These binding proteins 
can also be used in analytical methods such as for 

25 screening expression libraries and neutralizing activity. 
The binding proteins can also be used for diagnostic assays 
for determining circulating levels of polypeptides; for 
detecting or quantitating soluble polypeptides as marker of 
underlying pathology or disease. These binding proteins 

30 can also act as ZGCL-1 "antagonists" to block ZGCL-1 
binding and signal transduction in vitro and in vivo. 
These anti-ZGCL-1 binding proteins would be useful for 
inhibiting ZGCL-1 mediated activity. 

A variety of assays known to those skilled in the 

35 art can be utilized to detect antibodies which specifically 
bind to ZGCL-1 proteins or peptides. Exemplary assays are 
described in detail in Antibodies: A Laboratory Manual ,, 
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Harlow and Lane (Eds.)/ Cold Spring Harbor Laboratory 
Press, 1988- Representative examples of such assays 
include : concurrent Immunoelectrophoresis , 

radioimmunoassay, radioimmuno-precipitation, enzyme- linked 
5 immunosorbent assay (ELISA) , dot blot or Western blot 
assay, inhibition or competition assay, and sandwich assay- 
In addition, antibodies can be screened for binding to 
wild-type versus mutant ZGCL-1 protein or polypeptide. 

Antibodies and binding proteins to ZGCL-1 may be 

10 used for tagging cells that express ZGCL-1; for isolating 
ZGCL-1 by affinity purification; for diagnostic assays for 
determining circulating levels of ZGCL-1 polypeptides; for 
detecting or quantitating soluble ZGCL-1 as marker of 
underlying pathology or disease; in analytical methods 

15 employing FACS; for screening expression libraries; for 
generating anti-idiotypic antibodies; and as neutralizing 
antibodies or as antagonists to block ZGCL-1 mediated 
activity both in vitro and in vivo. Suitable direct tags 
or labels include radionuclides, enzymes, substrates, 

20 cofactors, inhibitors, fluorescent markers, 

chemiluminescent markers, magnetic particles and the like; 
indirect tags or labels may feature use of biotin-avidin or 
other complement/anti-complement pairs as intermediates. 
Antibodies and binding proteins herein may also be directly 

25 or indirectly conjugated to drugs, toxins, radionuclides 
and the like, and these conjugates used for in vivo 
diagnostic or therapeutic applications. Moreover, 
antibodies to ZGCL-1 or fragments thereof may be used in 
vitro to detect denatured ZGCL-1 or fragments thereof in 

30 assays, for example, Western Blots or other assays known in 
the art. 

These antibodies and binding proteins would also 
be useful as contraceptives to prevent the fertilization of 
an egg. Such antibodies would act as antagonists by 
35 inhibiting a component (s) of spermatogenesis and/or sperm 
activation. Such antibody "antagonists" can be used for 
contraception in humans and animals, in particular, 
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domestic animals and livestock. For instance, anti- ZGCL-1 
immunization could be used in place of surgical forms * of 
contraception (such as spaying and neutering) in animals, 
and would allow for the possibility of future breeding of 
5 those animals if desired. 

ZGCL-1 ligand polypeptides may be used to 
identify and characterize genes which bind to ZGCL-1. 
Proteins and peptides of the present invention can be 
immobilized on a column and membrane preparations run over 

10 the column ( Immobilized Affinity Ligand Techniques , 
Hermanson et al . , eds., Academic Press, San Diego, CA, 
1992, 195-202) . Proteins and peptides can also be 

radiolabeled ( Methods in Enzvmol ., vol. 182, "Guide to 
Protein Purification", M. Deutscher, ed., Acad. Press, San 

15 Diego, 1990, 721-37) or photoaf f inity labeled (Brunner et 
al., Ann. Rev. Biochem. 62:483-514, 1993 and Fedan et al . , 
Biochem. Pharmacol . 33.:H67-80, 1984) and specific cell- 
surface proteins can be identified. 

The ZGCL-1 polynucleotides and/or polypeptides 

20 disclosed herein can be useful as therapeutics. 
Polypeptides of the present invention are used to stimulate 
proliferation or differentiation of testicular cells. 
Proliferation and differentiation can be measured using 
cultured testicular cells or in vivo by administering 

25 molecules of the present invention to the appropriate 
animal model. Cultured testicular cells include dolphin 
DBl.Tes cells (CRL-6258) ; mouse GC-1 spg cells (CRL-2053) ; 
TM3 cells (CRL-1714) ; TM4 cells (CRL-1715) ; and pig ST 
cells (CRL-1746) , available from American Type Culture 

30 Collection, 12301 Parklawn Drive, Rockville, MD. 

The initially identified EST was derived from a 

human heart tumor library. ZGCL-1 proteins and polypeptides 

may play a role in the development of cardiovasculature 

tissue, proliferation and differentiation of endothelial 

35 cells and cardiomyocytes . Proliferation can be measured 
using cultured cardiac cells or in vivo by administering 

molecules of the claimed invention to the appropriate 
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animal model. Cultured cells include cardiac fibroblasts, 
cardiac myocytes, skeletal myocytes, human umbilical vein 
endothelial cells from primary cultures. Established cell 
lines include: NIH 3T3 fibroblast (ATCC No. CRL-1658) , CHH- 
5 1 chum heart cells (ATCC No. CRL-1680) , H9c2 rat heart 
myoblasts (ATCC No. CRL-1446) , Shionogi mammary carcinoma 
cells (Tanaka et al., Proc. Natl. Acad. Sci. 89 : 8928-89, 
1992) and LNCap.FGC adenocarcinoma cells (ATCC No. CRL- 
1740. ) 

10 Assays measuring cell proliferation or 

differentiation are well known in the art. For example, 
assays measuring proliferation include such assays as 
chemosensitivity to neutral red dye (Cavanaugh et al., 
Investigational New Drugs 8.:347-54, 1990), incorporation of 

15 radiolabelled nucleotides (Cook et al., Analytical Biochem . 
179:1-7, 1989), incorporation of 5-bromo-2 1 -deoxyuridine 
(BrdU) in the DNA of proliferating cells (Porstmann et al . , 
J. Immunol. Methods 82:169-79, 1985), and use of 
tetrazolium salts (Mosmann, J. Immunol. Methods 65 : 55-63 , 

20 1983; Alley et al . , Cancer Res. 48:589-601, 1988; Marshall 
et al., Growth Reg. 5.: 69-84, 1995; and Scudiero et al . , 
Cancer Res. 48.:4827-33, 1988) . Assays measuring 

differentiation include, for example, measuring cell- 
surface markers associated with stage-specific expression 

25 of a tissue, enzymatic activity, functional activity or 
morphological changes (Watt, FASEB , 5:281-84, 1991; 
Francis, Differentiation 57:63-75, 1994; Raes, Adv. Anim. 
Cell Biol. Technol. Bioprocesses , 161-71, 1989) . 

In vivo assays for evaluating cardiac neogenesis 

30 or hyperplasia include treating neonatal and mature rats 
with the molecules of the present invention. The animals 
cardiac function is measured as heart rate, blood pressure, 
and cardiac output to determine left ventricular function. 
Post-mortem methods for assessing cardiac improvement 

35 include: increased cardiac weight, nuclei/cytoplasmic 
volume, staining of cardiac histology sections to determine 
proliferating cell nuclear antigen (PCNA) vs. cytoplasmic 
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actin levels (Quaini et al . , Circulation Res. 75 : 1050-63 , 
1994 and Reiss et al . , Proc. Natl. Acad. Sci . 93.: 8630"-5, 
1996) . 

In vivo assays for evaluating the effect of ZGCL- 
5 1 polypeptides on testes are also well known in the art. 
For example, compounds can be injected intraperitoneally 
for a specific time duration. After the treatment period, 
animals are sacrificed and testes removed and weighed. 
Testicles are homogenized and sperm head counts are made 

10 (Meistrich et al . , Exp. Cell Res. 99 :72-8, 1976). 

Spermatogenesis is a sequential process and takes 
place in the seminiferous tubules, where germ cells 
ultimately mature into spermatozoa. Testis-specif ic 

factors that influence the maturation process may come 

15 directly from the Sertoli cells that are in contact with 
the sperm cells, or may be paracrine or endocrine factors. 
Many of the molecules produced outside the seminiferous 
tubules are transported into the sperm cell 
microenvironment by transport and binding proteins that are 

20 expressed by the Sertoli cells within the seminiferous 
tubules . 

Paracrine factors that cross the cellular barrier 
and enter the sperm cell microenvironment include molecules 
secreted from Leydig cells. Leydig cells are located in 

25 the interstitial space found between the seminiferous 
tubules, and produce several factors believed to play an 
important role in the maturation process, such as 
testosterone, Leydig factor, IGF-1, inhibin and activin. 
The expression of these, and other factors, may be specific 

3 0 to a defined stage in the spermatogenic cycle. 

The tissue specificity of ZGCL-1 expression 
suggests a role in spermatogenesis, and in view of this 
specificity, agonists and antagonists have enormous 
potential in both in vitro and in vivo applications. ZGCL- 

35 1 proteins and polypeptides and ZGCL-1 agonists are useful 
for stimulating proliferation and development of target 
cells in vitro and in vivo. For example, agonist compounds 
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are useful as components of defined cell culture media, and 
may be used alone or in combination with cytokines and 
hormones to replace serum that is commonly used in cell 
culture. Agonists are thus useful in specifically 

5 promoting the growth and/or development of testis-derived 
cells in culture. Agonist compounds could be used to 
influence cellular differentiation, proliferation or 
development through up-regulation of ZGCL-1 -modulated gene 
expression. Agonists and antagonists may also prove useful 

10 in the study of spermatogenesis and infertility. 
Antagonists are useful as research reagents for 
characterizing sites of DNA binding to the ZGCL-1 
transcription factor. Antagonists are also useful for 
modulating ZGCL-1 mediation transcription. For example, 

15 ZGCL-1 antagonists may be useful as male contraceptive 
agents . 

ZGCL-1 polypeptides and ZGCL-1 agonists would be 
useful therapeutics to treat infertility. Accordingly, 
proteins of the present invention may have applications in 

20 enhancing fertilization during assisted reproduction in 
humans and in animals. Such assisted reproduction methods 
are known in the art and include artificial insemination, 
in vitro fertilization, embryo transfer and gamete 
intrafallopian transfer, for example. Such methods are 

25 useful for assisting men and women who may have 
physiological or metabolic disorders that prevent natural 
conception. Such methods may also be used by women who are 
unable or do not desire to conceive naturally for other 
reasons . Such methods are also used in animal breeding 

3 0 programs, such as for livestock, zoological animal, 
endangered species or racehorse breeding and could be used 
as methods for the creation of transgenic animals. 
Proteins of the present invention can be added to expand 
the number of donor sperm cells prior fertilization of an 

3 5 egg. It is advantageous to increase the number of sperm 
during such procedures to enhance the likelihood of 
successful fertilization. The invention provides methods of 
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enhancing fertilization during assisted reproduction 
wherein a mammalian ZGCL-1 polypeptide is combined with 
sperm prior to fertilization of the egg, or to an egg-sperm 
mixture. Within one embodiment the assisted reproduction 
5 is artificial insemination. Within another embodiment the 
assisted reproduction is in vitro fertilization. 

Polypeptides of the present invention can used to 
enhance viability of cryopreserved sperm, in particular, 
to enhance the number of viable sperm upon thawing. Such 

10 cryopreserved sperm can be used in association with methods 
of assisted reproduction. The invention, provides methods 
of enhancing viability of cryopreserved sperm for use in 
fertilization of an egg, wherein a mammalian ZGCL-1 
polypeptide is added to sperm prior to fertilization, or to 

15 an egg- sperm mixture. 

Fusion of the ZGCL-1 polypeptide and an affinity 
tag (e.g., Glu-Glu affinity tag, FLAG tag, maltose-binding 
protein, an immunoglobulin domain) may be used to select 
sperm at a particular developmental stage for use in in 

20 vitro fertilization procedures. Staging the sperm 

increases the number and type of desired sperm, thereby 
increasing the likelihood of successful fertilization. 

In vivo, ZGCL-1 and ZGCL-1 agonists would find 
application in the treatment of infertility, in particular, 

25 male infertility. It is estimated that 5-6% of men of 
reproductive age are infertile and a predominant cause is 
abnormal sperm count. ZGCL-1 polypeptides and proteins can 
be administered to enhance sperm count. Expression vectors 
containing polynucleotides encoding ZGCL-1 polypeptides or 

3 0 proteins linked with a polynucleotide encoding a testis 
specific receptor can be administered for delivery to and 
expression in testis tissue. ZGCL-1 could also be packaged 
with a testis-specif ic receptor such that uptake of the 
ZGCL-1 polypeptides and proteins occurred only in the 

35 testis. Alternatively, ZGCL-1 proteins or polypeptides 
could be injected directly into the testis. The invention 
provides methods of treating infertility wherein a 
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mammalian ZGCL-1 polypeptide is administered to enhance 
sperm count. In a related embodiment administration is* in 
testicular tissue. 

As used herein antagonists are molecules which 
5 either bind to ZGCL-1 polypeptides or, alternatively, to a 
gene to which ZGCL-1 polypeptides bind, thereby inhibiting 
or eliminating the function of ZGCL-1. Such ZGCL-1 
antagonists would include antibodies; binding proteins; 
oligonucleotides which bind either to the ZGCL-1 

10 polypeptide or to its associated gene(s); natural or 
synthetic analogs of ZGCL-1 polypeptides which retain the 
ability to bind specific genes but do not result in 
transcription. Such analogs could be peptides or peptide- 
like compounds. Natural or synthetic small molecules which 

15 bind to receptors of ZGCL-1 polypeptides and prevent 
transcription are also contemplated as antagonists. 

These ZGCL-1 antagonists are useful agents in 
methods related to fertility and contraception by 
selectively intercepting a process (es) leading to 

20 successful reproduction. As such, ZGCL-1 antagonists would 
be useful for inhibiting spermatogenesis and sperm 
activation. Such ZGCL-1 antagonists can be used for 
contraception in humans and animals, in particular domestic 
animals and livestock, where they ultimately act to prevent 

25 the successful fertilization of an egg. Such antagonists 
could be used, for instance, in place of surgical forms of 
contraception (such as spaying and neutering) , and would 
allow for the possibility of future breeding of treated 
animals, if desired, by discontinuing administration of the 

30 antagonist. ZGCL-1 antagonists may prove useful similar to 
progesterone antagonists. Antiprogestogens, which 

antagonize binding but do not activate, effectively block 
the action of progesterone and are used as contraceptive 
agents. The invention provides methods of contraception 

35 wherein an antagonist of a mammalian ZGCL-1 polypeptide as 
described above is administered to a male recipient to 
prevent fertilization of an egg. Within one embodiment the 
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antagonist is an anti-ZGCL-1 binding protein. Within a 
related embodiment the antagonist is an anti-ZGCL-1 
antibody. 

The invention also provides isolated and purified 
5 ZGCL-1 polynucleotide probes. Such polynucleotide probes 
can be RNA or DNA. DNA can be either cDNA or genomic DNA. 
Polynucleotide probes are single or double -stranded DNA or 
RNA, generally synthetic oligonucleotides, but may be 
generated from cloned cDNA or genomic sequences and will 

10 generally comprise at least 16 nucleotides, more often from 
17 nucleotides to 25 or more nucleotides, sometimes 40 to 
60 nucleotides, and in some instances a substantial 
portion, domain or even the entire ZGCL-1 gene or cDNA. 
The synthetic oligonucleotides of the present invention 

15 have at least 80% identity to a representative ZGCL-1 DNA 
sequence (SEQ ID NO:l) or its complements. Preferred 
regions from which to construct probes include the 5' 
and/or 3' coding sequences, DNA binding domains, affinity 
domains, signal sequences and the like. Techniques for 

20 developing polynucleotide probes and hybridization 
techniques are known in the art, see for example, Ausubel 
et al., eds., Current Protocols in Molecular Biology , John 
Wiley and Sons, Inc., NY, 1991. For use as probes, the 
molecules can be labeled to provide a detectable signal, 

25 such as with an enzyme, biotin, a radionuclide, 
fluorophore, chemiluminescer , paramagnetic particle and the 
like, which are commercially available from many sources, 
such as Molecular Probes, Inc., Eugene, OR, and Amersham 
Corp., Arlington Heights, IL, using techniques that are 

30 well known in the art. 

Such probes can also be used in hybridizations to 

detect the presence or quantify the amount of ZGCL-1 gene 

or mRNA transcript in a sample. ZGCL-1 polynucleotide 

probes could be used to hybridize to DNA or RNA targets for 

3 5 diagnostic purposes, using such techniques such as 
fluorescent in situ hybridization (FISH) or 

immunohistochemistry . Polynucleotide probes could be used 
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to identify genes encoding ZGCL-l-like proteins. For 
example, ZGCL-1 polynucleotides can be used as primers 
and/or templates in PCR reactions to identify other novel 
transcription factors. Such probes can also be used to 
5 screen libraries for related sequences encoding novel 
transcription factors. Such screening would be carried out 
under conditions of low stringency which would allow 
identification of sequences which are substantially 
homologous, but not requiring complete homology to the 

10 probe sequence. Such methods and conditions are well known 
in the art, see, for example, Sambrook et al . , Molecular 
Cloning: A Laboratory Manual , Second Edition, Cold Spring 
Harbor, NY, 1989. Such low stringency conditions could 
include hybridization temperatures less than 42°C, 

15 formamide concentrations of less than 50% and moderate to 

low concentrations of salt. Libraries may be made of 

genomic DNA or cDNA. Polynucleotide probes are also useful 

for Southern, Northern, or slot blots, colony and plaque 
hybridization and in situ hybridization. Mixtures of 

20 different ZGCL-1 polynucleotide probes can be prepared 
which would increase sensitivity or the detection of low 
copy number targets, in screening systems. 

ZGCL-1 polypeptides may be used within diagnostic 
systems. Antibodies or other agents that specifically bind 

25 to ZGCL-1 may be used to detect the presence of circulating 
polypeptides. Such detection methods are well known in the 
art and include, for example, enzyme -linked immunosorbent 
assay (ELISA) and radioimmunoassay. Immunohistochemically 
labeled antibodies can be used to detect ZGCL-1 in 

30 biological samples. ZGCL-1 levels can also be monitored by 
such methods as RT-PCR, where ZGCL-1 mRNA can be detected 
and quantified. Such methods could be used as diagnostic 
tools to monitor and quantify polypeptide levels. The 
information derived from such detection methods would 

3 5 provide insight into the significance of ZGCL-1 
polypeptides in various conditions or diseases, and as a 
would serve as diagnostic methods for conditions or 
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diseases for which altered levels of ZGCL-1 are 
significant. Altered levels of ZGCL-1 ligand polypeptides 
may be indicative of pathological conditions including 
cancer and cardiac and reproductive disorders. 
5 The present invention also provides reagents 

which will find use in diagnostic applications. For 
example, the ZGCL-1 gene, a probe comprising ZGCL-1 DNA or 
RNA or a subsequence thereof can be used to determine if 
the ZGCL-1 gene is present on chromosome 5 or if a mutation 

10 has occurred. Detectable chromosomal aberrations at the 
ZGCL-1 gene locus include but are not limited to 
aneuploidy, gene copy number changes, insertions, 
deletions, restriction site changes and rearrangements. 

In general, these diagnostic methods comprise the 

15 steps of (a) obtaining a genetic sample from a patient; (b) 
incubating the genetic sample with a polynucleotide probe 
or primer as disclosed above, under conditions wherein the 
polynucleotide will hybridize to complementary 
polynucleotide sequence, to produce a first reaction 

20 product; and (iii) comparing the first reaction product to 
a control reaction product. A difference between the first 
reaction product and the control reaction product is 
indicative of a genetic abnormality in the patient. 
Genetic samples for use within the present invention 

25 include genomic DNA, cDNA, and RNA. The polynucleotide 
probe or primer can be RNA or DNA, and will comprise a 
portion of SEQ ID NO:l, the complement of SEQ ID NO:l, or 
an RNA equivalent thereof . Suitable assay methods in this 
regard include molecular genetic techniques known to those 

30 in the art, such as restriction fragment length 

polymorphism (RFLP) analysis, short tandem repeat (STR) 

analysis employing PCR techniques, ligation chain reaction 

(Barany, PCR Methods and Applications 1:5-16, 1991), 

ribonuclease protection assays, and other genetic linkage 

35 analysis techniques known in the art (Sambrook et al., 
ibid.; Ausubel et . al . , ibid . ; Marian, Chest 108 :255-65. 

1995). Ribonuclease protection assays (see, e.g., Ausubel 
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et al . , ibid . . ch. 4) comprise the hybridization of an RNA 
probe to a patient RNA sample, after which the reaction 
product (RNA- RNA hybrid) is exposed to RNase . Hybridized 
regions of the RNA are protected from digestion. Within 
5 PCR assays, a patient's genetic sample is incubated with a 
pair of polynucleotide primers, and the region between the 
primers is amplified and recovered. Changes in size or 
amount of recovered product are indicative of mutations in 
the patient. Another PCR-based technique that can be 

10 employed is single strand conformational polymorphism 
(SSCP) analysis (Hayashi, PCR Methods and Applications 
1:34-8, 1991) . 

The invention also provides nucleic acid-based 
therapeutic treatment. If a mammal has a mutated or lacks 

15 a ZGCL-1 gene, the ZGCL-1 gene can be introduced into the 

cells of the mammal. In one embodiment, a gene encoding a 
ZGCL-1 polypeptide is introduced in vivo in a viral vector. 

Such vectors include an attenuated or defective DNA virus, 
such as but not limited to herpes simplex virus (HSV) , 

20 papillomavirus, Epstein Barr virus (EBV) , adenovirus, 
adeno-associated virus (AAV), and the like. Defective 
viruses, which entirely or almost entirely lack viral 
genes, are preferred. A defective virus is not infective 
after introduction into a cell. Use of defective viral 

25 vectors allows for administration to cells in a specific, 
localized area, without concern that the vector can infect 
other cells. Examples of particular vectors include, but 
are not limited to, a defective herpes virus 1 (HSV1) 
vector (Kaplitt et al . , Molec. Cell. Neurosci . 2:320-30, 

30 1991) , an attenuated adenovirus vector, such as the vector 

described by Stratf ord-Perricaudet et al. ( J. Clin. Invest . 

90:626-30, 1992), and a defective adeno-associated virus 

vector (Samulski et al . , J. Virol . 61:3096-101, 1987; 
Samulski et al., J. Virol . 63:3822-28, 1989). 

35 In another embodiment, the gene can be introduced 

in a retroviral vector, e.g., as described in Anderson et 

al., U.S. Patent No. 5,399,346; Mann et al . , Cell 31:153, 
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1983; Ternin et al . , U.S. Patent No. 4,650,764; Temin et 
al., U.S. Patent No. 4,980,289; Markowitz et al . , J. Vifbl . 

62:1120, 1988; Temin et al . , U.S. Patent No. 5,124,263; 

Dougherty et al . , WIPO Publication WO 95/07358; and Kuo et 

5 al., Blood 82:845-52, 1993. 

Alternatively, the vector can be introduced by 
lipofection in vivo using liposomes. Synthetic cationic 

lipids can be used to prepare liposomes for in vivo 

transfection of a gene encoding a marker (Feigner et al., 

10 Proc. Natl. Acad. Sci. USA £4:7413-17, 1987; and Mackey et 

al., Proc. Natl. Acad. Sci. USA 85:8027-31, 1988). The use 

of lipofection to introduce exogenous genes into specific 
organs in vivo has certain practical advantages. Molecular 

targeting of liposomes to specific cells represents one 

15 area of benefit. It is clear that directing transfection 
to particular cells represents one area of benefit. It is 
clear that directing transfection to particular cell types 
would be particularly advantageous in a tissue with 
cellular heterogeneity, such as the pancreas, liver, 

2 0 kidney, and brain. Lipids may be chemically coupled to 
other molecules for the purpose of targeting. Targeted 
peptides, e.g., hormones or neurotransmitters, and proteins 
such as antibodies, or non-peptide molecules could be 
coupled to liposomes chemically. 

25 It is possible to remove the cells from the body 

and introduce the vector as a naked DNA plasmid and then 
re -implant the transformed cells into the body. Naked DNA 
vector for gene therapy can be introduced into the desired 
host cells by methods known in the art, e.g., transfection, 

30 electroporation, microinjection, transduction, cell fusion, 
DEAE dextran, calcium phosphate precipitation, use of a 

gene gun or use of a DNA vector transporter (see, for 
example, Wu et al . , J. Biol. Chem . 267 :963-7 , 1992; Wu et 
al., J. Biol. Chem . 263:14621-24, 1988). 

35 The ZGCL-1 polypeptides are also contemplated for 

pharmaceutical use. Pharmaceutically effective amounts of 
ZGCL-1 polypeptides, agonists or ZGCL-1 antagonists of the 
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present invention can be formulated with pharmaceutically 
acceptable carriers for parenteral, oral, nasal, rectal, 
topical, transdermal administration or the like, according 
to conventional methods. Formulations may further include 
5 one or more diluents, fillers, emulsifiers, preservatives, 
buffers, excipients, and the like, and may be provided in 
such forms as liquids, powders, emulsions, suppositories, 
liposomes, transdermal patches and tablets, for example. 
Slow or extended-release delivery systems, including any of 

10 a number of biopolymers (biological -based systems) , systems 
employing liposomes, and polymeric delivery systems, can 
also be utilized with the compositions described herein to 
provide a continuous or long-term source of the ZGCL-1 
polypeptide or antagonist. Such slow release systems are 

15 applicable to formulations, for example, for oral, topical 
and parenteral use. The term "pharmaceutically acceptable 
carrier" refers to a carrier medium which does not 
interfere with the effectiveness of the biological activity 
of the active ingredients and which is not toxic to the 

20 host or patient. One skilled in the art may formulate the 
compounds of the present invention in an appropriate 
manner, and in accordance with accepted practices, such as 
those disclosed in Remington 1 s Pharmaceutical Sciences , 
Gennaro (ed. ) , Mack Publishing Co., Easton, PA 1990. 

25 As used herein a "pharmaceutically effective 

amount" of a ZGCL-1 polypeptide, agonist or antagonist is 
an amount sufficient to induce a desired biological result. 
The result can be alleviation of the signs, symptoms, or 
causes of a disease, or any other desired alteration of a 

3 0 biological system. For example, an effective amount of a 
ZGCL-1 polypeptide is that which provides either subjective 
relief of symptoms or an objectively identifiable 
improvement as noted by the clinician or other qualified 
observer. For example, such an effective amount of a ZGCL- 

35 1 polypeptide results in an increase in sperm count. 
Effective amounts of the ZGCL-1 polypeptides can vary 
widely depending on the disease or symptom to be treated. 
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The amount of the polypeptide to be administered and its 

concentration in the formulations, depends upon the vehicle 

selected, route of administration, the potency of the 

particular polypeptide, the clinical condition of the 

5 patient, the side effects and the stability of the compound 

in the formulation. Thus, the clinician will employ the 

appropriate preparation containing the appropriate 

concentration in the formulation, as well as the amount of 

formulation administered, depending upon clinical 

10 experience with the patient in question or with similar 

patients. Such amounts will depend, in part, on the 

particular condition to be treated, age, weight, and 

general health of the patient, and other factors evident to 

those skilled in the art. Typically a dose will be in the 

15 range of 0.1-100 mg/kg of subject. Doses for specific 
compounds may be determined from in vitro or ex vivo 

studies in combination with studies on experimental 

animals. Concentrations of compounds found to be effective 

in vitro or ex vivo provide guidance for animal studies, 

2 0 wherein doses are calculated to provide similar 

concentrations at the site of action. 

The dosages of the present compounds used to 
practice the invention include dosages effective to result 
in the desired effects. Estimation of appropriate dosages 
25 effective for the individual patient is well within the 
skill of the ordinary prescribing physician ' or other 
appropriate health care practitioner. As a guide, the 
clinician can use conventionally available advice from a 
source such as the Physician's Desk Reference, 48 th Edition, 

3 0 Medical Economics Data Production Co., Montvale, New Jersey 

07645-1742 (1994) . 

Preferably the compositions are presented for 
administration in unit dosage forms. The term "unit dosage 
form" refers to physically discrete units suitable as 
35 unitary dosed for human subjects and animals, each unit 
containing a predetermined quantity of active material 
calculated to produce a desired pharmaceutical effect in 
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association with the required pharmaceutical diluent, 
carrier or vehicle. Examples of unit dosage forms include 
vials, ampules, tablets, caplets, pills, powders, granules, 
eyedrops, oral or ocular solutions or suspensions, ocular 
5 ointments, and oil-in-water emulsions. Means of 

preparation, formulation and administration are known to 
those of skill, see generally Remington: The Science and 
Prac t ice of Pharmacy , Gennaro, ed., Mack Publishing Co., 
Easton, PA, 19th ed. , 1995. 
10 The invention is further illustrated by the 

following non- limiting examples. 

EXAMPLES 

15 Example 1 

Identification of ZGCL-1 

Novel ZGCL-1 ligand-encoding polynucleotides and 
polypeptides of the present invention were initially 

20 identified by querying an EST database. An initial EST 
from a human heart tumor library was found. A second EST, 
from fetal mouse, was identified which was 87% identical at 
the nucleotide level to the human EST. Using this 
information, oligonucleotide primers ZC12991 (SEQ ID NO: 5) 

25 and ZC12992 (SEQ ID NO: 6) were made to the EST sequence and 
a pooled human testis library was screened using PCR. 
Eighty reactions were set up, each consisted of 2.5 fil 10X 
PCR reaction buffer (Boehringer Mannheim, Indianapolis, 
IN), 2 fil dNTPs mix (2.5 mM each, Perkin-Elmer , Foster 

30 City, CA) , 0.5 fil sense primer, ZC 12,992 (20 pmol/|il) , 0.5 
fil antisense primer, ZC 12,991 (20 pmol/nl) , 2.5 fil 
JRediLoad (Research Genetics, Inc., Huntsville, AL) , 0.5 fil 

AmpliTaq™ (Perkin-Elmer Cetus, Norwalk, Ct . ) about 2-4 ng 
of DNA from an individual testis library pool and ddH 2 0 for 
35 a total volume of 25 /xl. The reactions were overlaid with 
an equal amount of mineral oil and sealed. The PCR cycler 
conditions were as follows: an initial 35 cycles of a 20 
second denaturation at 94°C, 30 second annealing at 55°C and 



WO 99/09168 



PCT/US98/17243 



61 



3 0 second extension at 72°C, followed by a final 1 cycle 
extension of 10 minutes at 72°C. PCR products were 
analyzed on a 1 % agarose gel and three pools giving the 
expected 200 bp PCR product were replated on to LB plates. 
5 Colonies were screened by PCR using gene-specific primers. 

Four individual colonies were picked from each 
pool and added to microcentrifuge tubes by swirling the 
toothpick with the colony on it in a tube containing 18.5 
1 H2O, 2.5 |il lOx Taq polymerase buffer (Boehringer 

10 Mannheim, Indianapolis, IN) , 2 |Lil 10 mM dNTPs (Perkin 
Elmer), 0.75 \il ZC12991 (SEQ ID NO: 5) (20 pmol/jil) , 0.75 ml 
ZC12992 (SEQ ID NO: 6) (20 pmol/jil), and 0.5 [il Taq 
polymerase. Amplification reactions were incubated at 94°C 
for 1 minute to lyse the bacteria and expose the plasmid 

15 DNA, then run for 25 cycles of 94°C, 20 seconds; 55°C, 30 
seconds; 72°C, 30 seconds to amplify cloned inserts, 
followed by a 10 minute extension at 72°C. Products were 
analyzed by electrophoresis on a 1% agarose gel. Clones 
giving a single 200 bp product were identified as positive, 

20 and the sequence confirmed by sequence analysis. 

One clone, 83.1.3 was used to identify the 
corresponding cDNA. The clone was amplified using a 
QIAwell 8 plasmid kit (Qiagen, Inc., Chatsworth, CA) 
according to manufacturer's instructions, a 5 ml overnight 

25 culture in LB + 50 mg/ml ampicillin was prepared. The 
template was sequenced on an Applied Biosystems™ model 373 
DNA sequencer (Perkin-Elmer Cetus, Norwalk, Ct . ) using the 
ABI PRISM™ Dye Terminator Cycle Sequencing Ready Reaction 
Kit (Perkin-Elmer Corp.) according the manufacturer's 

30 instructions. Oligonucleotides ZC694 (SEQ ID NO:7), 2C2681 
(SEQ ID NO:8), ZC12991 (SEQ ID NO:5), 2C12992 (SEQ ID 
N0:6), ZC14122 (SEQ ID NO:9), ZC14183 (SEQ ID NO:10), 
ZC14184 (SEQ ID NO:ll), ZC14237 (SEQ ID NO:12), ZC14238 
(SEQ ID NO:13), ZC14284 (SEQ ID NO:14) and ZC14345 (SEQ ID 

3 5 NO: 15) were used to sequence from the clone. Sequencing 
reactions were carried out in a Hybaid OmniGene Temperature 
Cycling System (National Labnet Co., Woodbridge, NY). 
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Sequencher™ 3.1 sequence analysis software (Gene Codes 
Corporation, Ann Arbor, MI) was used for data analysis. The 
resulting 1,467 bp sequence is disclosed in SEQ ID NO:l 
which contained the initially identified EST sequence. 

5 

Example 2 
Tissue Distribution 

Human Multiple Tissue Northern Blots (MTN I, MTN 

10 II, and MTN III; Clontech) were probed with an 
approximately 200 bp (SEQ ID NO: 16) PCR derived probe 
containing the sequence of the EST . The probe was 
amplified from a human heart Marathon™- ready cDNA library 
using oligonucleotide primers ZC12991 (SEQ ID NO: 5) and 

15 ZC12992 (SEQ ID NO:6). The Marathon™- ready cDNA library 
was prepared according to manufacturer's instructions 
(Marathon™ cDNA Amplification Kit; Clontech, Palo Alto, CA) 
using human heart poly A + RNA (Clontech) . The probe was 
amplified in a polymerase chain reaction as follows: 35 

20 cycles of 94°C for 20 seconds, 55°C for 30 seconds, and 72°C 
for 30 seconds, followed by 1 cycle at 72°C for 10 minutes. 
The resulting DNA fragment was electrophoresed on a 2% GTG 
agarose gel, the fragment was purified using the QIAquickTM 
method (Qiagen, Chatsworth, CA) , and the sequence was 

25 confirmed by sequence analysis. The probe was 

radioactively labeled using the random priming MULTIPRIME 
DNA labeling system (Amersham, Arlington Heights, IL) , 
according to the manufacturer's specifications. The probe 
was purified using a NUCTRAP push column (Stratagene, La 

30 Jolla, CA) . ExpressHyb™ (Clontech) solution was used for 
prehybridization and as a hybridizing solution for the 
Northern blots. Hybridization took place overnight at 65°C 
using 1 x 10 6 cpm/ml of labeled probe. The blots were then 
washed at 50°C in 0 . IX SSC, 0.1% SDS. A predominant 

35 transcript of 3.2 kb was detected in testis. Reduced 
expression was seen in thyroid, spinal cord, stomach, lymph 



WO 99/09168 



PCT/US98/17243 



63 

node and trachea. A weak transcript was seen in placenta 
and pancreas at 4.5 kb. 

Example 3 

5 Chromosomal Assignment and Placement of ZGCL-1 

ZGCL-1 was mapped to chromosome 5 using the 
commercially available version of the Whitehead 
Institute/MIT Center for Genome Research* s GeneBridge 4 

10 Radiation Hybrid Panel (Research Genetics, Inc., 
Huntsville, AL) . The GeneBridge 4 Radiation Hybrid Panel 
contains PCRable DNAs from each of 93 radiation hybrid 
clones, plus two control DNAs (the HFL donor and the A23 
recipient). A publicly available WWW server (http://www- 

15 genome .wi .mit . edu/cgi-bin/contig/rhmapper .pi) allows 
mapping relative to the Whitehead Institute/MIT Center for 
Genome Research's radiation hybrid map of the human genome 
(the "WICGR" radiation hybrid map) which was constructed 
with the GeneBridge 4 Radiation Hybrid Panel. 

20 For the mapping of ZGCL-1 with the GeneBridge 4 

RH Panel, 25 fil reactions were set up in a PCRable 96 -well 
microtiter plate (Stratagene, La Jolla, CA) and used in a 
RoboCycler Gradient 96 thermal cycler (Stratagene) . Each of 
the 95 PCR reactions consisted of 2.5 /xl 10X KlenTaq 

25 reaction buffer (Clontech) , 2 /xl dNTPs mix (2.5 mM each, 

Perkin-Elmer) , 1.25 pi sense primer, ZC 12992 (SEQ ID 

NO:6), 1.25 \il antisense primer, ZC 12991 (SEQ ID NO:5), 
2.5 pi RediLoad (Research Genetics), 0.5 pi 50X Advantage 

KlenTaq Polymerase Mix (Clontech) , 2 5 ng of DNA from an 
3 0 individual hybrid clone or control and ddH 2 0 for a total 
volume of 25 jxl. The reactions were overlaid with an equal 
amount of mineral oil and sealed. The PCR cycler conditions 
were as follows: an initial 1 cycle 5 minute denaturation 
at 95°C, 35 cycles of a 1 minute denaturation at 95°C, 1 
35 minute annealing at 58°C and 1 minute extension at 72°C, 
followed by a final 1 cycle extension of 7 minutes at 72°C. 



WO 99/09168 



PCT7US98/17243 



64 

The reactions were separated by electrophoresis on a 3% 
NuSieve GTG agarose gel (FMC Bioproducts, Rockland, ME) . 

The results showed that ZGCL-1 maps 10.09 cR from 
the framework marker WI-6737 on the chromosome 5 WICCR 
5 radiation hybrid map. Relative to the centromere, its 
nearest proximal marker was WI-4897 and its nearest distal 
maker was WI-14295. The use of surrounding markers position 
ZGCL-1 in the 5q3 5.3 region on the integrated LDB 
chromosome 5 map (The Genetic Location Database, University 

10 of Southampton, WWW server :http: //cedar . 

genetics . soton . ac ,uk/public_html/) . 

Genes mapping to human chromosome 5q34-q35 have 
been primarily localized to mouse chromosome 11, in the A5- 
Bl and Al-Bl region. The mouse "germ cell deficient" gene, 

15 GCD, locus maps to chromosome 11 in the A2-A3 region 
(Duncan et al., Mamm . Genome 6.:697-9, 1995). Germ cell 
deficient leads to improper migration and/or proliferation 
of primordial germ cells during embryonic development 
resulting in infertility in the adult mouse. Mice having 

20 this mutation have been hypothesized to be animal models 
for the human reproductive disorders, premature ovarian 
failure and Sertoli cell only syndrome. 

From the foregoing, it will be appreciated that, 
although specific embodiments of the invention have been 

25 described herein for purposes of illustration, various 
modifications may be made without deviating from the spirit 
and scope of the invention. Accordingly, the invention is 
not limited except as by the appended claims. 
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CLAIMS 

What is claimed is : 

1. An isolated polypeptide comprising a sequence 
of amino acid residues that is at least 80% identical in amino 
acid sequence to residues 1-479 of SEQ ID NO: 2, wherein said 
sequence comprises a POZ domain corresponding to amino acid 
residues 61-178 of SEQ ID NO: 2. 

2. An isolated polypeptide according to claim 2, 
wherein said polypeptide is at least 90% identical in amino 
acid sequence to residues 1-479 of SEQ ID NO: 2, wherein said 
sequence comprises a POZ domain corresponding to amino acid 
residues 61-178 of SEQ ID NO: 2. 

3. An isolated polypeptide according to claim 1, 
covalently linked amino terminally or carboxy terminally to a 
moiety selected from the group consisting of affinity tags, 
toxins, radionucleotides, enzymes and f luorophores . 

4. A isolated polynucleotide encoding a polypeptide 
comprising a sequence of amino acid residues that is at least 
80% identical in amino acid sequence to residues 1-479 of SEQ 
ID NO: 2, wherein said sequence comprises a POZ domain 
corresponding to amino acid residues 61-178 of SEQ ID NO: 2. 

5. An isolated polynucleotide according to claim 
4, wherein said polypeptide is at least 90% identical in amino 
acid sequence to residues 1-479 of SEQ ID NO: 2, wherein said 
sequence comprises a POZ domain corresponding to amino acid 
residues 61-178 of SEQ ID NO: 2. 

6 . An isolated polynucleotide comprising the 
sequence of nucleotide 1 to nucleotide 1437 of SEQ ID NO:4. 
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7. An oligonucleotide probe or primer comprising 
at least 14 contiguous nucleotides of a polynucleotide of SEQ 
ID NO: 4 or a sequence complementary to SEQ ID NO:4. 

8. An expression vector comprising the following 
operably linked elements: 

a transcription promoter; 

a DNA segment encoding a polypeptide comprising a 
sequence of amino acid residues that is at least 80% identical 
in amino acid sequence to residues 1-479 of SEQ ID NO: 2, 
wherein said sequence comprises a POZ domain corresponding to 
amino acid residues 61-178 of SEQ ID NO: 2; and 

a transcription terminator. 

9. An expression vector according to claim 8, 
wherein said DNA segment encodes a polypeptide that is at 
least 90% identical in amino acid sequence to residues 1-479 
of SEQ ID NO: 2, wherein said sequence comprises a POZ domain 
corresponding to amino acid residues 61-178 of SEQ ID NO: 2. 

10. An expression vector according to claim 8, 
wherein said DNA segment encodes a polypeptide covalently 
linked amino terminally or carboxy terminally to an affinity 
tag. 

11. An expression vector according to claim 8, 
wherein said DNA segment further encodes a secretory signal 
sequence operably linked to said polypeptide. 

12. A cultured cell into which has been introduced 
an expression vector comprising the following operably linked 
elements : 

a transcription promoter; 

a DNA segment encoding a polypeptide comprising a 
sequence of amino acid residues that is at least 80% identical 
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in amino acid sequence to residues 1-479 of SEQ ID NO; 2, 
wherein said sequence comprises a POZ domain corresponding to 
amino acid residues 61-178 of SEQ ID NO: 2; and 

a transcription terminator; 

wherein said cell expresses said polypeptide encoded 
by said DNA segment . 

13. A method of producing a polypeptide comprising: 
culturing a cell into which has been introduced an 

expression vector comprising the following operably linked 
elements : 

a transcription promoter; 

a DNA segment encoding a polypeptide comprising a 
sequence of amino acid residues that is at least 80% identical 
in amino acid sequence to residues 1-479 of SEQ ID NO: 2, 
wherein said sequence comprises a POZ domain corresponding to 
amino acid residues 61-178 of SEQ ID NO: 2; and 

a transcription terminator; 

whereby said cell expresses said polypeptide encoded 
by said DNA segment; and 

recovering said expressed polypeptide. 

14. A pharmaceutical composition comprising a 
polypeptide, said polypeptide comprising a sequence of amino 
acid residues that is at least 80% identical in amino acid 
sequence to residues 1-479 of SEQ ID NO: 2, wherein said 
sequence comprises a POZ domain corresponding to amino acid 
residues 61-178 of SEQ ID NO: 2; 

in combination with a pharmaceutically acceptable 

vehicle . 

15. An antibody that specifically binds to an 
epitope of a polypeptide comprising a sequence of amino acid 
residues that is at least 80% identical in amino acid sequence 
to residues 1-479 of SEQ ID NO: 2, wherein said sequence 
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comprises a POZ domain corresponding to amino acid residues 
61-178 of SEQ ID NO:2. 

16. A binding protein that specifically binds to 
an epitope of a polypeptide comprising a sequence of amino 
acid residues that is at least 80% identical in amino acid 
sequence to residues 1-479 of SEQ ID NO: 2, wherein said 
sequence comprises a POZ domain corresponding to amino acid 
residues 61-178 of SEQ ID NO: 2. 

17. A method for detecting a genetic abnormality in 
a patient, comprising: 

obtaining a genetic sample from a patient; 

incubating the genetic sample with a polynucleotide 
comprising at least 14 contiguous nucleotides of SEQ ID NO:l 
or the complement of SEQ ID NO:l, under conditions wherein 
said polynucleotide will hybridize to complementary 
polynucleotide sequence, to produce a first reaction product; 

comparing said first reaction product to a control 
reaction product, wherein a difference between said first 
reaction product and said control reaction product is 
indicative of a genetic abnormality in the patient. 
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10 20 30 40 50 60 - 

zgcll MRRATDSVTVRGSHKRKRSSGSFCYCHPDSETDEDEEEGDEQQRLLNTPRRKKLKSTSKY 

»« • • I a • a * ■ 

• •*» * • » a «•••*» • a a ■ ■ 

GCL1D MGQIVGSMHMNVAEVFSNRRKRKRSTDSSLGKDDPAQLDTTQPKKKKLLTTTQY 

10 20 30 40 . 50 



70 80 90 100 110 120 

zgcll I YQTLFLNGENSDIK I CALGEEWSLHKI YLCQSGYFSSMFSGSWKESSMN I I ELE I PDQN 

■ f • « a a I * a •••■*«*••• a a a a a a a • * 

GCL1D IYKALFKEEKNSDVAVMALDKVWHLHKVYLSQSPYFYTMFNGTWREAQQNFIQITILDDR 
60 70 80 90 100 110 



130 140 150 160 170 180 

zgcll IDVEALQVAFGSLYRDDVLIKPSRVVAILAAACLLQLDGLIQQCGETMKETVNVKTVCGY 

t # • ■ a a a a * ■ • • • • »••* a a a ■ » 

GCL1D ITVAS LDAVFGSMYSDE I E I ESADV I SVLATATLFHLDG 1 1 DKCAE VMVDN I SPETAI QY 
120 130 140 150 160 170 



190 200 210 220 230 

zgcll YTSAGTYGLDSVKKKCLEWLLNNLMT--HQNVELFKELSINVMKQLIGSSNLFVMQVEMD 

P > a • ■ a a • a i » • * * • ■ a • a 

» • » a a t a ■ « • ■ a a a a a a •■••••«»*«•* • ■• 

GCL1D YEAACQYGVVGVKKSTFQWFQINLLSIYSKQPNLLRHISIELMSALTASPDLYVMQTEFS 
180 190 200 210 220 230 



240 250 260 270 
zgcl 1 IYTALKKWMFLQLVPSWNGS LKQLLTETDVWFSKQRKDF E 

• «•«•••»* ■ » • 

a*a •••••»»»•**»* aaaaaaa* a * 

GCL1D LYTLLRTWMFLRLHPDYDPEDPVQRAEALKTQELLVNAGVETHAPSGDVVQWTYFTSRSE 
240 250 260 270 280 290 



280 290 300 310 320 330 

zgcll GMAFLETEQGKPFVSVFRHLRLQYIISDLASARIIEQDAVVPSEWLSSVYKQQWFAMLRA 

a a a «••*• * * a « a a • a « t * a a a a 

a a a • a a » * * * a a a a a a a » * * • • • a a • • a a a 

GCL1D ERSFLATPEGQPYVKVFQKLRTQYLTNHYMDLKIIYNDNIIPKEWLYRHIHNHWDALLRI 
300 310 320 330 340 350 
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340 350 360 370 380 390 

zgcll EQDSE-VGPQEINKEELEGNSMRCGRKLAKDGEYCWRWTGFNFGFDLLVTYTNRYIIFKR 

• • • * ■ ■•*■•• 4 • ••■■■■•*«■ * » 

■ B B * ■ »«*«« * * * * • * ft ft ft • • « • ■■*■■»••«••«• * • • ■ 

GCL1D DHGQEDCSPQQLDDEQFFENCMRCGRMLLEPGYQKWRWTGFNFGMDL I L IMDSRRLN I RR 
360 370 380 390 400 410 



400 410 420 430 440 450 

zgcll NTLNQPCSGSVSLQPRRSIAFRLRLASFDSSGKLICSRTTGYQILTLEKDQEQVVMNLDS 

* * * ■ * ♦ « • ■ * * * • ■ • 

• •» • • • • • # • * t * • * • ■ • « A • • » ■ « • » t ■ * 

GCL1D HHRHEH-ERVLSLQTKRKFMVRTTVTSINAQRQAVFTQTSEICSLSLEKNEEVPLMVLDP 
420 430 440 450 460 470 



460 470 
zgcll RLLIFPLYICCNFLYISPEKKN 



GCL1D KL-VHPLLISINMLVVMPPNQSFKEIVPLSEEATTSLSIPISEIGANSDRPLSPSSADDS 

480 490 500 510 520 530 



GCL1D AVFIGDSEPSTP 

540 



Fig. lb 
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SEQUENCE LISTING 

<110> ZymoGeneti cs . Inc. 

1201 Eastlake Avenue East 
Seattle. Washington 98102 
United States of America 

<120> TESTIS -SPECIFIC TRANSCRIPTION FACTOR 
ZGCL-1 

<130> 97-18PC 

<150> 06/056,130 
<151> 1997-08-19 

<160> 16 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1469 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 

<222> (34). . .(1469) 
<400> 1 

ggggctcggc ccggaggccg gacactggag acg atg egg egg gec acg gat tct 54 

Met Arg Arg Ala Thr Asp Ser 
1 5 

gtt act gtg egg ggc age cac aag cgc aag egg age age ggg tec ttc 102 
Val Thr Val Arg Gly Ser His Lys Arg Lys Arg Ser Ser Gly Ser Phe 
10 . 15 20 

tgc tac tgt cac cct gac teg gag acg gac gag gat gag gag gag ggg 150 
Cys Tyr Cys His Pro Asp Ser Glu Thr Asp Glu Asp Glu Glu Glu Gly 
25 30 35 



gac gag cag cag egg etc etc aac acc cct cga agg aaa aaa tta aag 
Asp Glu Gin Gin Arg Leu Leu Asn Thr Pro Arg Arg Lys Lys Leu Lys 



198 
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40 45 50 55 

agt aca tct aaa tat att tat caa aca tta ttt ttg aat ggt gaa aac 246 
Ser Thr Ser Lys Tyr He Tyr Gin Thr Leu Phe Leu Asn Gly Glu Asn 

60 65 70 

agt gac att aag att tgt get eta gga gaa gaa tgg age tta cac aaa 294 
Ser Asp He Lys He Cys Ala Leu Gly Glu Glu Trp Ser Leu His Lys 

75 80 85 

ata tat tta tgt caa tct ggc tac ttt tct agt atg ttc agt ggt tct 342 
He Tyr Leu Cys Gin Ser Gly Tyr Phe Ser Ser Met Phe Ser Gly Ser 
90 95 100 

tgg aaa gaa tec age atg aat att att gaa ctg gag att cct gac cag 390 
Trp Lys Glu Ser Ser Met Asn He He Glu Leu Glu He Pro Asp Gin 
105 110 115 

aac att gat gta gaa gca ctg cag gtt gca ttt ggt tea ctg tat cga 438 
Asn He Asp Val Glu Ala Leu Gin Val Ala Phe Gly Ser Leu Tyr Arg 
120 125 130 135 

gat gat gtc ttg ata aag ccc agt cga gtt gtt gee att ttg gca gca 486 
Asp Asp Val Leu He Lys Pro Ser Arg Val Val Ala He Leu Ala Ala 

140 145 150 

get tgt ttg ctg cag ttg gac ggt tta ata cag cag tgt ggt gag aca 534 
Ala Cys Leu Leu Gin Leu Asp Gly Leu He Gin Gin Cys Gly Glu Thr 

155 160 165 

atg aag gaa aca gtt aat gtg aaa act gta tgt ggc tat tac aca tea 582 
Met Lys Glu Thr Val Asn Val Lys Thr Val Cys Gly Tyr Tyr Thr Ser 
170 175 180 

gca ggg ace tat gga tta gat tct gta aag aaa aag tgc ctt gaa tgg 630 
Ala Gly Thr Tyr Gly Leu Asp Ser Val Lys Lys Lys Cys Leu Glu Trp 
185 190 195 

ctt eta aac aat ttg atg act cac cag aat gtt gaa ctt ttt aaa gaa 678 
Leu Leu Asn Asn Leu Met Thr His Gin Asn Val Glu Leu Phe Lys Glu 
200 205 210 215 

etc agt ata aat gtc atg aaa cag etc att ggt tea tct aac tta ttt 
Leu Ser He Asn Val Met Lys Gin Leu He Gly Ser Ser Asn Leu Phe 

220 225 230 
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gtg atg caa gtg gag atg gat ata tac act get eta aaa aag tgg atg 774 

Val Met Gin Val Glu Met Asp He Tyr Thr Ala Leu Lys Lys Trp Met 

235 240 245 

ttc ctt caa ctt gtg cct tct tgg aat gga tct tta aaa cag ctt ttg 822 

Phe Leu Gin Leu Val Pro Ser Trp Asn Gly Ser Leu Lys Gin Leu Leu 

250 255 260 

aca gaa aca gat gtc tgg ttt tct aaa cag agg aaa gat ttt gaa ggt 870 

Thr Glu Thr Asp Val Trp Phe Ser Lys Gin Arg Lys Asp Phe Glu Gly 
265 270 275 

atg gec ttt ctt gaa act gaa caa gga aaa cca ttt gtg tea gta ttc 918 

Met Ala Phe Leu Glu Thr Glu Gin Gly Lys Pro Phe Val Ser Val Phe 
280 285 290 295 

aga cat tta agg tta caa tat att ate agt gat ctg get tct gca aga 966 

Arg His Leu Arg Leu Gin Tyr He He Ser Asp Leu Ala Ser Ala Arg 

300 305 310 



att att 


gaa 


caa gat get gta 


gta 


cct 


tea gaa tgg etc 


tct tct 


gtg 


1014 


He He 


Glu 


Gin Asp Ala Val 


Val 


Pro 


Ser Glu Trp Leu 


Ser Ser 


Val 








315 




320 




325 






tat aaa 


cag 


cag tgg ttt get 


atg 


ctg 


egg gca gaa cag 


gac agt 


gag 


1062 


Tyr Lys 


Gin 


Gin Trp Phe Ala 


Met 


Leu 


Arg Ala Glu Gin 


Asp Ser 


Glu 






330 




335 




340 








gtg ggg 


cct 


caa gaa ate aat 


aaa 


gaa 


gaa eta gag gga 


aac age 


atg 


1110 


Val Gly 


Pro 


Gin Glu He Asn 


Lys 


Glu 


Glu Leu Glu Gly 


Asn Ser 


Met 




345 




350 






355 








agg tgt 


ggt 


aga aag ctt gee 


aaa 


gat 


ggt gaa tac tgc 


tgg cgt 


tgg 


1158 


Arg Cys 


Gly 


Arg Lys Leu Ala 


Lys 


Asp 


Gly Glu Tyr Cys 


Trp Arg 


Trp 




360 




365 






370 




375 




aca ggt 


ttt 


aac ttc ggc ttc 


gac 


eta 


ctt gta act tac 


ace aat 


eg a 


1206 


Thr Gly 


Phe 


Asn Phe Gly Phe 


Asp 


Leu 


Leu Val Thr Tyr 


Thr Asn 


Arg 








380 






385 


390 







tac ate att ttc aaa cgc aat aca ctg aat cag cca tgt age gga tct 
Tyr He He Phe Lys Arg Asn Thr Leu Asn Gin Pro Cys Ser Gly Ser 

395 400 405 



1254 
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qtc aqt 


tta 


cag 


cct 


cqa 


aqg 


age ata 


Val Ser 


Leu 


Gin 


Pro 


Apq 


Apq 


Ser lie 




410 










415 


tct ttt 


qat 


aqt 


aqt 


qqa 


aaa 


eta ata 


Spp Phe 


Asd 


Ser 


Ser 


Gly 


Lvs 


Leu He 


425 










430 




Pad rltr) 


ctt 


aca 


ctt 


aaa 


aaa 


aat caa 


Gin He 


Lpu 


Thr 


Lpu 

i— \A 


Glu 


Lvs 


Asd Gin 

f AO Ls U Ml 


440 








445 






gac age 


agg 


ctt 


ctg 


ate 


ttc 


cct tta 


Asp Ser 


Arg 


1 Pii 


i pu 

1— C 


lie 


Phe 

i I it 


PPD 1 PU 








460 








tat ata 


tea 


cca 


gaa 


aaa 


aag 


aa 


Tyr He 


Ser 


Pro 


Glu 


Lys 


Lys 








475 










<210> 


2 










<211> 


478 










<212> 


PRT 










<213> 


Homo sapiens 




<400> 


2 










Met Arg 


Arg 


Ala 


Thr 


Asp 


Ser 


Val Thp 


1 






5 








Lys Arg 


Ser 


Ser 


Gly 


Ser 


Phe 


Cys Typ 






20 








25 


Asp Glu 


Asp 


Glu 


Glu 


Glu 


Gly 


Asp Glu 




35 










40 


Pro Arg 


Arg 


Lys 


Lys 


Leu 


Lys 


Sep Thp 


50 










55 




Leu Phe 


Leu 


Asn 


Gly 


Glu 


Asn 


Sep Asp 


65 








70 






Glu Glu 


Trp 


Ser 


Leu 


His 


Lys 


lie Tyr 








85 








Ser Ser 


Met 


Phe 


Ser 


Gly 


Ser 


Tpp Lys 






100 








105 


Glu Leu 


Glu 


He 


Pro 


Asp 


Gin 


Asn He 




115 










120 


Ala Phe 


Gly 


Ser 


Leu 


Tyr 


Arg 


Asp Asp 


130 










135 





gca ttt aga tta cgt ttg get 1302 
Ala Phe Apg Leu Apg Leu Ala 

420 

tgt agt aga aca act ggc tat 1350 
Cys Sep Apg Thp Thp Gly Typ 
435 

gaa caa gtg gtg atg aac ttg 1398 
Glu Gin Val Val Met Asn Leu 
450 455 

tat ate tgc tgt aac ttc ttg 1446 
Typ He Cys Cys Asn Phe Leu 
465 470 

1469 



Val Arg Gly 


Ser His Lys 


Arg 


10 


15 




Cys His Pro 


Asp Ser Glu 


Thr 




30 




Gin Gin Arg 


Leu Leu Asn 


Thr 




45 




Ser Lys Tyr 


He Tyr Gin 


Thr 


60 






He Lys lie 


Cys Ala Leu 


Gly 


75 




80 


Leu Cys Gin 


Ser Gly Tyr 


Phe 


90 


95 




Glu Ser Ser 


Met Asn He 


He 




110 




Asp Val Glu 


Ala Leu Gin 


Val 




125 




Val Leu lie 


Lys Pro Ser 


Arg 


140 
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Val 

V U 1 


Val 

V U 1 


Ala 

/> 1 u 


He 

J> 1 t 


145 








He 


Gin 


Gin 


Cvs 


Val 


Cys 


Gly 


Tyr 








180 


Lvs 


Lvs 
y 


Lvs 
i-j' -o 


Cvs 

Vv Jr O 






195 




Asn 

no i i 


Val 

V (J 1 


Glu 

VJ 1 14 


Leu 

l_ V> M 




21 n 






Hp 

1 1 c 


Glv 
u ty 


OCI 


OCI 


LCD 








Thr 
1 1 II 


Ala 

rA 1 Q 


1 PM 


1 vc 
Lyo 


Gly 


Ser 


Leu 


Lys 








260 


Gin 


Am 
mi y 


1 VS 


Asn 

nop 






275 

CIO 




1 VC 
LjO 


Pro 

r i u 


Phe 

r i it 


Val 

V □ 1 




290 






Spr 


Asn 


1 PIJ 

L- t VJ 


Ala 


JUJ 








Spp 

JCI 


Glu 

VJ 1 u 


Trn 

1 1 ij 


1 PI J 


Arg 


Ala 


Glu 


Gin 








340 


61 u 

VJ 1 U 


Leu 

l— V— U 


Glu 


Glv 

VJ 1 J 






355 




61 v 

vj i jr 


Glu 

VJ 1 U 


Tvr 
• j i 


Cvs 




370 






i PIJ 


Val 

V U 1 




Tvr 
i ji 


385 








Asn 


Gin 


Pro 


Cvs 

Vvjr o 


Ala 


Phe 


Arg 


Leu 








420 


Cys 


Ser 


Arg 


Thr 






435 




Glu 


Gin 


Val 


Val 




450 






Tyr 


He 


Cys 


Cys 



465 

<210> 3 



Leu 

1— V* VJ 


Ala 


Ala 


Ala 




150 






Glv 


Glu 


Thr 


Met 


165 








Tyr 


Thr 


Ser 


Ala 


Leu 


Glu 


Trp 


Leu 








200 


Phe 


1 VS 


Glu 


1 PIJ 

LCU 






215 




Acn 

nol 1 


1 PI 1 


Phe 

r i ic 


Val 

v a i 










1 VS 
LjrO 


Trn 
1 1 [j 


Mpt 

1 IC L 


Phe 

r i ic 


245 

£-*TvJ 








Gin 


1 pi i 

LCU 


1 PIJ 
LCU 


Thr 
1 1 1 1 


Phe 


Glu 


Gly 


Met 








280 


Spr 


Val 

V U 1 


Phe 

1 I It 


Ara 

m y 






295 

CJO 




Ser 


Ala 

1 \ 1 u 


Ara 

nl y 


He 




310 






Ser 


Ser 


Val 

V VJ 1 


Tvr 
i j i 


325 








Asn 

/\0 VJ 


Ser 


Glu 

VJ 1 LA 


Val 


Asn 


Ser 


Met 


Arg 








360 


Tro 


Arc? 


Tro 


Thr 

1 1 1 1 






375 




Thr 


Asn 

f \ w 1 1 


Ara 


Tvr 




390 






Ser 


Gly 

vj i Jr 


Ser 


Val 










Arg 


Leu 


Ala 


Ser 


Thr 


Gly 


Tyr 


Gin 








440 


Met 


Asn 


Leu 


Asp 






455 




Asn 


Phe 


Leu 


Tyr 




470 







Cvs 

v> jr o 


Leu 


I pij 

l_ t VJ 


Gin 

VJ 1 1 1 






155 




Lvs 


Glu 

VJ 1 U 


Thr 
1 1 1 1 


Val 

V U 1 




170 

-L / VJ 






Glv 


Thr 


Tvr 


Glv 


185 








Leu 

l_ t V4 


Asn 


Asn 


1 PIJ 

l_ t u 


Ser 


He 


Asn 


Val 








220 


Mpt 

1 IC u 


Gin 

VJ 1 1 1 


Val 

V Q 1 


U 1 LI 










1 PIJ 
LCU 


Gin 

VJ I 1 1 


1 PM 

LCU 


Val 

v a i 










Glu 

VJ 1 u 


Thr 
1 1 1 1 


Asn 


Val 

V U 1 










Ala 


Php 

r i ic 


1 PI 1 
LCU 


Rlu 

u 1 U 


His 


Leu 


Arg 


Leu 








300 


He 

1 1 t 


Glu 

vj i u 


Gin 

VJ II 1 


Asn 










1 VS 


Gin 

U I 1 1 


Gin 

Hill 


Trn 
1 1 y 




330 






Glv 


Pro 


Gin 

VJ 1 II 


Glu 

VJ 1 u 


345 








Cvs 


Glv 


Ara 

ni y 


I VS 


Gly 


Phe 


Asn 


Phe 








380 


He 


He 

j. 1 t 


Phe 


Lvs 






395 

\J -S \J 




Cap 
JCI 


1 Ol 1 

LcU 


Rln 
u 1 f 1 


Prn 
rl U 




410 






Phe 


Asp 


Ser 


Ser 


425 








He 


Leu 


Thr 


Leu 


Ser 


Arg 


Leu 


Leu 








460 


He 


Ser 


Pro 


Glu 



475 



Leu 


Asp 


Gly 


Leu 








160 


Asn 


Val 


Lvs 

i~J o 


Thr 

i 1 1 1 






175 




Lpu 

L_ t L4 


Asn 


Spr 

JC 1 


Val 

V VJ 1 




190 

X ~J u 






Met 


Thr 
1 1 ii 


His 

1 1 1 o 


Gin 

VJ 1 1 ■ 


205 








Mpt 

1 IC L 


1 VQ 


Rln 

U t 1 1 


1 PI 1 
LCU 


Met 


Asp 


He 


Tyr 








240 


r 1 u 


Sop 
Oci 


Trn 

i rp 


Acn 
Moi I 










i r p 


Php 
n ic 


Sop 

OCI 






CI u 






Thr 

III! 


Rl u 

U 1 u 


Rln 

U 1 II 


Rl v 
u i y 










Gin 


Tvr 


Hp 

1 1 c 


Tie 

1 1 c 


Ala 


Val 


Val 


Pro 








320 


Php 

r i ic 


r\ i a 


Mpt 

1 IC i* 


1 PM 
LCU 






335 




He 


Asn 

i \<j i i 


Lvs 


Glu 

VJ 1 VJ 




350 






Leu 


Ala 


Lvs 


Asd 


365 








Gly 

VJ 1 J 


Phe 
ill 


Asn 


Leu 

1— t VJ 


Mr y 


Moll 


i nr 


1 d 1 1 

LcU 








400 


Arg 


Arg 


Ser 


He 






415 




Gly 


Lys 


Leu 


He 




430 






Glu 


Lys 


Asp 


Gin 


445 








He 


Phe 


Pro 


Leu 



Lys Lys 
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<211> 569 
<212> PRT 

<213> Drosophila melanogaster 



<400> 3 



ricL 


Gl V 

u ly 


Gin 


Tip 

I 1 c 


Val 

Vd 1 


Gl v 
u ly 


*\pr 

JCl 


Mpf 

1 IC L 


Hi<; Mpt A«;n Val Ala 


Gin Val 
u i u v d i 


Php 
ri le 


i 
i 
















in 


1R 




Cop 


Acn 
Mol 1 


Arn 
Ml y 


Arn 


1 v/c 


Arn 
Mi y 


1 uc 

Lyo 


Arn 
mi y 


Qflp Thr Acn Qor Cor 

JCl Mil MO[J JCl JCl 


1 pii Gl v 
LcU u i y 


1 vc 
Lyo 


















CO 


ou 




A cn 
Mop 


Acn 
Mop 


Prn 
ri U 


M 1 a 


Gin 
u 1 1 1 


1 pi i 
LcU 


Acn 
Mop 


Thr 
1 1 1 1 


Thr Gin Prn 1 vc 1 vc 
nil u in r i u Lyo Lyo 


1 \/C 1 vc 

Lyo Lyo 


1 Ol I 

LcU 






7R 










40 


4S 

*T J 






! Pt i 
LcU 


Thr 


Thr 


Thr 
1 1 1 1 


Gin 

U 1 1 1 


Tvr 


Tip 

1 1 c 


T\/r 

iyr 


1 vc Ala 1 pi i Php 1 \/q 
LjO Mid LCU rile LjrO 


Gin Gin 


1 vc 

Lys 




D\J 










jj 




nO 






Acn 
Moll 


Cop 


Acn 
Mop 


Val 


Al A 

M I d 


Val 
vd i 


Mpf 
1 lc L 


Ala 
M I d 


1 on Acn 1 vc Val Trn 
LcU Mop Lyo vd l lip 


Mi c 1 pi i 
n \ o LcU 


Mi c 
n i o 


U J 










/ u 






/ j 




RO 


1 \/c 

Lyo 


Val 
Vd 1 




i pi i 
LcU 


^p r 

JCl 


Gin 

U 1 1 1 


Cor 
JCl 


Prn 
r i u 


Tvr Php Tvr Thr Mpf 
i y i r i ic i y i 1 1 ii i ic L 


Php Acn 
rile Moil 


Gl v 
vj ly 










OD 










j j 




Thr 
i ill 


Trn 

irp 


Arn 
Ml y 


Gill 


Ala 
M I d 


Gin 

u 1 1 1 


Gin 


Acn 

MOI 1 


Php T1p Gin T1p Thr 
nic lie uiii lie nil 


Tip 1 pii 

l 1 c LeU 


Acn 
Mop 








i no 

1UVJ 










1 0^ 


1 1 0 




Acn 
Mop 


Arn 
Mi y 


Tip 

1 t c 


Thr 
1 1 1 1 


Val 


Ala 
M i d 


^pr 

JCl 


i PI I 
LCU 


Acn Ala Val Php Gl v 

r\OU nlu VQ 1 rllC Vj 1 Jr 


JCl I IC L 


Tvr 

•y' 






11R 

1 -L J 










120 


125 

J. C_ J 






JCI 


Acn 

AAO p 


Glu 


Tip 

1 1 c 


Glu 

Vj 1 U 


Tip 

1 1 c 


Glu 

VJ I u 


Spr 

JC 1 


Ala Acn Val Tip ^pr 

t \ i u no l/ v a i lie jci 


Val I pu 

V U 1 LCU 


Ala 

AA 1 U 




130 










135 




140 






Thr 
1 1 1 1 


Ala 


Thr 


1 PI J 


Php 

r i ic 


His 

1 1 1 o 


1 PIJ 


Acn 


Gl v Tip Tip Asn 1 v c 

VJ 1 J 1IC xic r\o y> \—J o 


fv c Ala 


Glu 

VJ 1 \A 


14^ 










150 






155 

U J 




160 

1 VJ Vy 


Val 

v a i 


Mpf 

1 IC u 


Val 

v a i 


Acn 

no yj 


Acn 


Hp 

1 1 c 


Spr 

JCl 


Pro 


Glu Thr Ala He Gin 

VJ 1 VJ lilt n 1 U 1IC VJ 1 1 1 


Tvr Tvr 


Glu 

VJ 1 u 










165 








170 


175 




Ala 


Ala 


Lj o 


Gin 


Tvr 
i ji 


Gl v 


Val 


Val 

V u 1 


Gl v Val 1 vs 1 Spr 


Thr Phe 


Gin 

VJ 1 1 1 








180 

1UU 










185 


190 




irp 


Php 
r lie 


Gin 


T1p 
I 1 c 


Ac r 
Moll 


9 pi i 
LcU 


i PI 1 
LcU 


JCl 


Tip Tvr ^pr 1 vc Gl n 
i ic ijri jci Lyo ulll 


Prn Acn 
r I u Mo 1 1 


1 PI 1 
LcU 






195 










200 


205 






Leu 


Arg 


His 


He 


Ser 


He 


Glu 


Leu 


Met Ser Ala Leu Thr 


Ala Ser 


Pro 




210 










215 




220 






Asp 


Leu 


Tyr 


Val 


Met 


Gin 


Thr 


Glu 


Phe Ser Leu Tyr Thr 


Leu Leu 


Arg 


225 










230 






235 




240 


Thr 


Trp 


Met 


Phe 


Leu 


Arg 


Leu 


His 


Pro Asp Tyr Asp Pro 


Glu Asp 


Pro 










245 








250 


255 




Val 


Gin 


Arg 


Ala 


Glu 


Ala 


Leu 


Lys 


Thr Gin Glu Leu Leu 


Val Asn 


Ala 








260 










265 


270 




Gly 


Val 


Glu 


Thr 


His 


Ala 


Pro 


Ser 


Gly Asp Val Val Gin 


Trp Thr 


Tyr 






275 










280 


285 






Phe 


Thr 


Ser 


Arg 


Ser 


Glu 


Glu 


Arg 


Ser Phe Leu Ala Thr 


Pro Glu 


Gly 




290 










295 




300 
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PI n 

b i n 


riO 


iyr vd i Lyb vd i rne 


PI n 1 \/c 1 on A rrt "Thr* PI n T\/n 

bin Lyb Leu t\i y inr bin iyr 


1 oi i Th n 

Leu inr 


ouo 








\JL-U 


Asn 


Hi c 
HI 5 


T\/n Mq+" Acn 1 on 1 \/c 

iyr net Mbp Leu Lyb 


Tlo. Tlo Tv/r* Acn Acn Acn Tlo 
lie lie iyr HbM Mbp Mbil lie 


lie ri U 










OOO 


Lys 


pi 1 1 

b 1 U 


I rp Leu iyr Ary nib 


' 1 a LH c Acn LH c Tnn Acn Al a 

lie nib Mbn ms i rp Asp Mia 


1 Ql 1 1 OI 1 

LcU LGU 






qad 

04U 


^ac ^cn 

040 OOU 




Arg 


Tl o 

i le 


A cn Ui c PI w Pin Pin 

Asp MIS biy bin blU 


Ar a P v / f C" Pi y» PI va 1 m i 

Asp uys oer rro bin bin Leu 


A r n Acn 

ASp ASp 






OCC 
ODD 


ocn Q£C 

OOU OOO 




pi 1 1 


pi m 

bin 


nt-kr\ Dl-in Plii Acr*» Pv/r 

rne Kne biu Asn Lys 


riet Arg Lys biy Arg net. Leu 


1 am Pill 

Leu blU 




o/U 


07C 

o/o 


OQH 
OOU 




Hro 


pi w 

b ly 


iyr bin Lys irp Arg 


T r « ~VV\ PI \/ Dkft AriA Daa PI w 

i rp inr biy rne Asn rne biy 


net Asp 


OQC 
OOO 




oon 


jyo 


Ann 

4UU 


Leu 


i le 


Leu lie riet Asp oer 


Arg Arg Leu Asn lie Arg Arg 


HIS HIS 






/i ric 

4Uo 


>ii n 
41U 


410 


Arg 


HIS 


b lu Hi s biu Arg va I 


Leu oer Leu bin inr Lys Arg 


Lys rne 






a on 


/ioc /ion 
4^0 4oU 




U -.4- 

riet 


va i 


Arg i nr i nr va i i nr 


C Tin Arm A 1 -\ Plk-k A r»n PlkA 

oer lie Asn Aia bin Arg bin 


Al n Wal 

a i a va 1 






/IOC 

4oo 


44U 440 




rne 


I nr 


p"l« Thr* Ca^ Plii Tin 

bin i nr oer biu lie 


Lys oer Leu oer Leu b i u Lys 


Acn Plii 

Asn blU 




40U 


ACC 
4DD 


AAH 
40U 




pi 1 1 

b 1 U 


Vd 1 


Kro Leu net va i Leu 


Asp rro Lys Leu vai nis rro 


1 OI 1 1 OI 1 

Leu Leu 


400 




A1C\ 
4/ u 


4/0 


40U 


1 1 e 


C /-\ f> 

oer 


Tin Acvn Mcf 1 ai i \/ n 1 

lie Asn net Leu vai 


\/-3 "] Mq+ Drn Drn Acn Pin Qor 

va i net rro rro asm bin oer 


Dho 1 \/c 

rne Lyb 






arc; 

400 


AQH 


AQR 

*+370 


Glu 


He 


Vai Pro Leu Ser Glu 


Glu Ala Thr Thr Ser Leu Ser 


He Pro 






500 


505 510 




He 


Ser 


Glu He Gly Ala Asn 


Ser Asp Arg Pro Leu Ser Pro 


Ser Ser 






515 


520 525 




Ala 


Asp 


Asp Ser Ala Vai Phe 


He Gly Asp Ser Glu Pro Ser 


Thr Pro 




530 


535 


540 




Ser 


Ser 


Pro Ala Pro Arg Pro 


Arg He Ala Trp Ser Ala Ser 


Glu Thr 


545 




550 


555 


560 


Gly 


Ala 


He Cys Gly Gin Leu 


Ala Cys 





565 

<210> 4 
<211> 1437 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Degenerate nucleotide sequence encoding ZGCL-1 
polypeptide of SEQ ID NO: 2 
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<400> 4 

atgmgnmgng cnacngayws ngtnacngtn mgnggnwsnc ayaarmgnaa rmgnwsnwsn *60 

ggnwsnttyt gytaytgyca yccngaywsn garacngayg argaygarga rgarggngay 120 

garcarcarm gnytnytnaa yacnccnmgn mgnaaraary tnaarwsnac nwsnaartay 180 

athtaycara cnytnttyyt naayggngar aaywsngaya thaarathtg ygcnytnggn 240 

gargartggw snytncayaa rathtayytn tgycarwsng gntayttyws nwsnatgtty 300 

wsnggnwsnt ggaargarws nwsnatgaay athathgary tngarathcc ngaycaraay 360 

athgaygtng argcnytnca rgtngcntty ggnwsnytnt aymgngayga ygtnytnath 420 

aarccnwsnm gngtngtngc nathytngcn gcngcntgyy tnytncaryt ngayggnytn 480 

athcarcart gyggngarac natgaargar acngtnaayg tnaaracngt ntgyggntay 540 

tayacnwsng cnggnacnta yggnytngay wsngtnaara araartgyyt ngartggytn 600 

ytnaayaayy tnatgacnca ycaraaygtn garytnttya argarytnws nathaaygtn 660 

atgaarcary tnathggnws nwsnaayytn ttygtnatgc argtngarat ggayathtay 720 

acngcnytna araartggat gttyytncar ytngtnccnw sntggaaygg nwsnytnaar 780 

carytnytna cngaracnga ygtntggtty wsnaarcarm gnaargaytt ygarggnatg 840 

gcnttyytng aracngarca rggnaarccn ttygtnwsng tnttymgnca yytnmgnytn 900 

cartayatha thwsngayyt ngcnwsngcn mgnathathg arcargaygc ngtngtnccn 960 

wsngartggy tnwsnwsngt ntayaarcar cartggttyg cnatgytnmg ngcngarcar 1020 

gaywsngarg tnggnccnca rgarathaay aargargary tngarggnaa ywsnatgmgn 1080 

tgyggnmgna arytngcnaa rgayggngar taytgytggm gntggacngg nttyaaytty 1140 

ggnttygayy tnytngtnac ntayacnaay mgntayatha thttyaarmg naayacnytn 1200 

aaycarccnt gywsnggnws ngtnwsnytn carccnmgnm gnwsnathgc nttymgnytn 1260 

mgnytngcnw snttygayws nwsnggnaar ytnathtgyw snmgnacnac nggntaycar 1320 

athytnacny tngaraarga ycargarcar gtngtnatga ayytngayws nmgnytnytn 1380 

athttyccny tntayathtg ytgyaaytty ytntayathw snccngaraa raaraar 1437 

<210> 5 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC12991 
<400> 5 

ctgctgtatt aaaccgtcca actg 24 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide ZC12992 
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<400> 6 
ctagtatgtt cagtggttct tgaa 



24 



<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC694 
<400> 7 

taatacgact cactataggg 20 

<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC1681 



<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14122 
<400> 9 

cagtattcag acatttaacc 20 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14183 
<400> 10 



<400> 8 
gaataagagt atagaaga 
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gaaggtacta cagcatcttg 20 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14184 
<400> 11 

aagtggtgat gaacttggac 20 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i nucleotide ZC14237 
<400> 12 

gagatcgcac cattgcactc 20 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14238 
<400> 13 

gctactgttt ccaactgatc 20 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14284 
<400> 14 

gtgcgatctc ggctcactgc 20 
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<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide ZC14345 
<400> 15 

atgtcgtctt ggagatcggg 20 

<210> 16 
<211> 200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Northern probe 
<400> 16 

ctagtatgtt cagtggttct tggaaagaat ccagcatgaa tattattgaa ctggagattc 60 
ctgaccagaa cattgatgta gaagcactgc aggttgcatt tggttcactg tatcgagatg 120 
atgtcttgat aaagcccagt cgagttgttg ccattttggc agcagcttgt ttgctgcagt 180 
tggacggttt aatacagcag 200 
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